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CONFIGURATIONS DEFINED BY THETA FUNCTIONS 
By Artuur B. CoBLEe 


The aspects of theta function theory which I wish to present today have 
come to my own attention in connection with an effort to develop an invariant 
theory for the Cremona group in a projective space which would in some sense 
parallel the invariant theory of the projective group in that space. One type 
of such a theory arises in the plane in connection with the idea of a set of n 
points P*, as the carrier of the totality =%, of all the complete linear systems of 
curves which can be defined by their multiplicities at the points of the set. 
This duality between P%, and >%, is obviously invariant under projective trans- 
formation, i.e., if P*, is carried into P;? by a projectivity 7, then 2? defined by 
P*, is carried by x into = defined by P,’. But it is also true that if P% and 
P’? are such that n — p pairs p; , p; drawn from the two sets are corresponding 
pairs of a Cremona transformation 7 for which the remaining p points in each 
set are the direct and inverse F-points of 7, then also 2%, defined by P%, is carried 
by 7 into =,” defined by P:”. Under these circumstances we say that the set 
P*, is congruent to the set P,” under the Cremona transformation 7 and regard 
this notion of congruence of sets of points P%, as the extension to the Cremona 
group of the notion of projectivity of sets of points. In this notion of con- 
gruence, as in projectivity, the order of the points in the related sets P*, P.? 
is obviously material. 

When we extend this notion to sets of points P* in spaces of higher dimen- 
sion k, it is necessary to confine the Cremona transformations to elements of the 
“Tegular’’ Cremona group, i.e., to transformations which can be defined by 
‘Ssolated” F-points—transformations which are termed “punctual”? by Miss 
Hudson. 

In developing this notion of congruence, we find it convenient to eliminate 
projectivity by using the obvious canonical form in which the first k + 2 points 
of P*, are taken to be the reference points, and the unit point and the factors of 
proportionality in the codrdinates of the remaining n — k — 2 points are 
so adjusted that the last codérdinate of each is the same, say uw. Then the 
set P* is uniquely determined by the codrdinates of a point P in a space 
Skin—x-2), 


P: Weider jet +--+ tu (¢,2’,--- =k+3,---,n3j,j,--- 2 1,---,k). 


The ratios x,;;:u are then absolute projective invariants or double ratios deter- 
mined by P*,. Naturally this mapping of sets P% in S; on points P of 
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Sicn—k-2) has many “‘singular” or F-loci corresponding to “‘special’’ sets P*, , 
but generically it is birational. 

A first inquiry with respect to this mapping concerns the variation of P in- 
duced by a change in the order of the points of PS. This change is repre- 
sented by a Cremona group G,; in Sxcn—z-2) , the n! points P in a conjugate set 
of G,, representing the n! orders in a P* which corresponds to one of the n! 
points P. The invariant theory of this G,, is essentially the projective invariant 
theory of P. regarded as a symmetric set of n points in S;. It is thus one 
rather immediate extension of the theory of invariants of a single binary form. 

A second and more important inquiry with respect to this mapping concerns 
the variation of P induced by the transition from P* to a congruent P’*. If 
we use the theorem that the regular Cremona group in S, is generated by 
collineations and the single regular transformation of type 22; = 1 (¢ = 1, --- , 
k + 1), then it is necessary to add to G,,; only one additional generator to 
obtain the new type of variation. This additional generator has the very 
simple form 2;;7;; = 1, wu’ = 1. With G,, it generates a Cremona group G,,, 
in Sxcn—z-2) Which is, except in a few instances to be noticed later, infinite and 
discontinuous. <A conjugate set of points P under G,,, represents the aggregate 
of sets P*, congruent in some order to one ordered set in the aggregate. 

We may say that the group G,,x in Sxcn—x-2) , a group without absolute con- 
stants, represents the capacity of the regular Cremona transformations in S, 
with no more than n F-points to distort the projective properties of sets of no 
more than n points. 

The nature of the group G,,, may be studied with comparative ease through its 
isomorphism with a linear group. In the case G,.2 let the set of points P?, be on a 
cuspidal cubic C*, 2, = ¢°, x. = t, x3 = 1, with parameters t,, --- ,¢,. The cusp 
ist = o, the flex is ¢ = 0, and the collinear condition is s; + se + s; = 0. Under 
the quadratic transformation Aj; with F-points at t; , f2 , ts the cubic C’ is trans- 
formed into a cuspidal cubic C”, the point p(t) of C* going into the point p’(t) of 
c”. OnC”, however, the collinear condition is s; + s: + 3 +t ++ = 0. 
If then on C” we make the change of parameter t’ = t + 3(t; + te + ts) so as 
to restore the form si + ss + 8; = 0 of the collinear condition, the cubic c 
can be projected back upon C’ in such a way that p’(t’) falls on p(t’). But the 
inverse F-points on C” of A253 are those whose parameters ¢ are respectively 
— (te + ts), —(t; + 4), —(4 + &). Hence the linear transformation 


. t=t+Uht+bht+h)—(h+b+h) (= 1,2,3), 
123 + 

= t+ hh tht b) (j = 4,---,n) 
furnishes that set of points t; , --- , t, on C*® which is congruent to the original 
set of points t; , --- , t, on C* under Ay. The reordering of P*, being obviously 
equivalent to a permutation of 4; , --- ,t,, we see that the group G,.2 is simply 
isomorphic with the collineation group on f, , --- , f. generated by the permuta- 
tions of t; , --- , 4, and the element L123. We say the collineation group rather 


than the linear group since the transformation t’ = pt represents a collineation 
P ° 3: ° 2. ° ° ° 12 
which carries C’ into itself and P;, into a projectively equivalent P,, . 
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For the set P*, , by using a cuspidal C*** in S, we find similarly a collineation 
group, isomorphic with G,,,, which is generated by the permutations of t; , --- , tn 
and the element 


bt ES tee thea) — tee + tea G=1,---,k+1), 
k-1 
k+1 


Ty, --+,e4a? 

ty = t+ (t+ +++ + tes) (j=k+2,---,n). 
In a few particular cases congruence implies projectivity so that the isomorphism 
between the collineation group and G,,, is 1 to 2. It is easy to verify that this 
linear group has a quadratic invariant form 


{n(k — 1) — (K+ 1)*}{(L#)} — & - 19}. 


If this form is definite and negative, the linear group, and therefore also the iso- 
morphic G,x, is finite. The condition for this is n(k — 1) — (k + 1)’ < 0, 
or, in general, n < k + 3, except that for k = 2, 3, 4 this upper limit is in- 
creased by 3, 1, 1, respectively. 

When the Cremona group G,,,x in Lx(n—x-2) is finite, it has an invariant theory 
of type quite similar to that of a finite projective group. When, however, it is 
infinite and discontinuous, the situation is not so simple. The group then may 
or may not have invariants, an invariant being a primal in 2 which is trans- 
formed into itself by all the elements of the group to within extraneous factors 
which arise from the “principal loci” of the particular transformation employed. 
However, the group has, in common with infinite continuous Cremona groups, 
the property of having invariant linear systems of primals. For example, in 
the first infinite cases which arise in S, and S; in connection with the point sets 
P; and P3 , the sets are on a unique elliptic curve which has projective invariants 
S, T and a linear system of invariants S* + kT”. This incidence of point set 
and normal elliptic curve being invariant under regular Cremona transformation 
with F-points at Pj, Ps, respectively, the corresponding infinite Cremona 
groups Gy 2 and Gs.3 in Ly and Ly , respectively, have pencils of invariant primals. 
In these, and in the few analogous simple instances, there exists also a discon- 
tinuous aggregate of invariant primals of increasing orders. Thus for P§ the 
condition that P3 be the set of r-fold points of a curve of order 3r (r = 2) yields 
for each value of r an invariant primal of Gy». Similarly, the conditions that 
Pi) be on a cubic curve, or that Pip be on a quadric surface, yield invariant 
primals of Gi. and Gyw,3 , respectively. 

For larger values of n the conditions on P*, that the points lie on a normal 
elliptic curve E**' in S, yield a linear system with base E of primals in 


Yk(n—k—2) invariant under G,, , the base of this system being the locus of points P 
which map sets P* of this special character. Similarly, for k = 3, the conditions 
on P* that the n-points lie on a normal rational surface R*™ in S, yield a linear 
system with base R of primals invariant under G,,.. It is not difficult to see 
that the base E is contained in the base R though the linear systems of primals 
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defined by E and R# have different orders. There has not thus far been observed 
in S; (k 2 4) types of three- or more-dimensional manifolds which are unaltered 
in type by regular Cremona transformation in S; , so that the question as to the 
existence of other invariant linear systems of G,,, remains open. 

The isomorphism of the group G,, with a linear group has already been 
mentioned. This linear group was represented on n variables, but it is usually 
more convenient to represent it on n + 1 variables which may be identified 
with respectively the order and the multiplicity of a linear system of primals 
in S, at the n points of P,. The group then has also an invariant linear 
form, but, to offset this, it has only integral coefficients. Any group of this 
character has for each value of the integer v an invariant subgroup of finite 
index which consists of those elements which reduce to the identity mod ». 
The finite factor group of this invariant subgroup may be represented by the 
elements of the linear group with coefficients reduced mod v. Thus the infinite 
Cremona groups G,,,, have a sequence of invariant subgroups and a corresponding 
sequence of finite factor groups G,,x(v). In the finite cases these invariant sub- 
groups are usually merely the identity, and the factor groups therefore the 
finite G,x itself. 

In the first infinite G,,, = Gs. these invariant subgroups determined by » 
are characterized by the fact that in 2x,-x-2) there exists a primal M(v) whose 
points are invariant under the operations of the corresponding invariant sub- 
group. Then the G).» effects upon the points of the invariant M(v) only those 
permutations of a group isomorphic with the factor group G»,2(v). This primal 
M(v) is the locus of points P whose corresponding sets P3 have the property of 
being the 9 »-fold points of a curve of order 3v. In further cases these invariant 
subgroups are not so easily defined in terms of P’*, itself. 

Of particular interest in such a connection as has just been described is the 
first prime modulus 2. For, the linear group has a quadratic invariant, and 
the polarized quadratic may be regarded as a null system [since ry2 + 2241 = 
XiY2 — Ley; (mod 2)]. In the theory of the integral linear transformation of the 
2p periods of the theta functions of p variables a similar invariant null system 
appears, and under such transformation the 2°” — 1 proper half periods are 
permuted under the finite linear group with coefficients reduced mod 2 which 
has this invariant null system. Thus it is natural to expect that the factor 
group G,,x(2) is either this theta factor group or a subgroup of it. 

A detailed examination of G, ,(2) shows that this is indeed true. According 
as n, k = 0, 1, 2, 3 (mod 4) we find 16 cases. In two of these cases, G,,,(2) is 
simply isomorphic to the group of the above null-system, say G,(2), a simple 
group. In four other cases, G,,(2) is simply isomorphic to that subgroup, 
also simple, of G,(2) which leaves one of the 2°” odd or even theta functions 
unaltered. In the remaining ten cases G, (2) contains an invariant subgroup, 
usually Abelian, whose order is a power of 2, and the factor group of G,,x(2) 
with respect to this invariant subgroup is again either @,(2) itself, or the sub- 
group of G,(2) which leaves an odd or even theta unaltered. 

With the composition of G,,(2) thus determined there remains to be con- 
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sidered the subgroup (Gn, of Gn. whose factor group is this G, (2). In the 
case (2)Gy9,2 this group is Abelian, of additive character, with parameters of the 
form dy; + --- + Agvs, where A, , --- , As are positive integers and , --- , vg 
general complex numbers. In the next case of Gy,2 the group (2)Gio,2 may be 
described as isomorphic to the group of Cremona transformations which has an 
invariant rational sextic. If ¢ is a parameter on such a sextic, it is therefore 
isomorphic to an infinite discontinuous group with elements of type t’ = 
(at + b)/(ct + d), and this isomorphism is simple if there exist no Cremona 
transformations 7' for which every point of the sextic is fixed. In a recent 
number of the Roma Rendiconti, Pompili gave a construction for such a trans- 
formation 7’, which, however, turns out to be erroneous. 

In developing the theta functions of p complex variables, we customarily 
express the variables in terms of the 2p periods by means of a set of 2p real 
parameters g1,---,9p, 9i,-*+ 5G» so that a value system wu has a period 


characteristic u = > . The 2” half periods, including the zero half period, 


are then given by ’ , the 2p values e¢, e’ being 0 or 1 with a denominator 2. 
2 
The various theta functions are also distinguished by the values of a theta 


characteristic {ih h, h’ also real, in such a way that 


ofed(«+[2]) = oft te} 


where EZ is a properly chosen exponential factor. In particular the 2” odd and 
even theta functions are given by 0{” (u), where n, 7’ are 0 or 1 with de- 
2 


nominator 2. It is in terms of this “characteristic notation” that the majority 
of the theta formulas have been given. 

However, in order to make the transition from the group G,,.(2) attached to 
sets of points P*, to the theta modular group a quite different notation, the 
so-called “basis notation” is practically essential. In this notation we use 
2p + 2 indices 1, 2, --- ,2p + 2 and name the 2” half periods by an even 
number of indices P;2,....2% = P2x+1,...,.2>42, the zero half period being given by 
P = P,,....2»12. On the other hand the 2” odd and even theta functions are 
named by sets of p + 1 + 2k indices as in 


#1 .2,..-,p41—2k(U) = B o41—2k41,---,2p42(U). 


These functions are even or odd with k. All the tactical relations among the 
half periods and functions are now quite elementary. Thus P;,...;,, + Pj,...;.; = 
P3, ...ingjy--+jg, With, as always, like indices canceling. Also 


ee + Pp---tey) = EBB, sing s—anin---ins(U)- 


Two half periods are syzygetic or azygetic according as they have or have not 
an even number of indices in common. A function does not, or does, vanish 
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for a half period according as k is even or odd in the number p + 1 — 2k of 
their non-common indices. 

The transition from the one notation to the other depends on the choice of 
the basis half periods P;,;,. A convenient choice is the following 


} in Px Ps sd 
B eee J Pop42,1 Pops2123 Pepy23,....5 ¢++ Poppi] 
For this choice the function 


(0 0.-- 0) 


du) = "10 0---0f 


(u) 


1,3,5,---,.2p41(U) = 2,4,6,--.,2p42(U). 
For this choice, if the indices are divided into complementary sets a, --- , 
Qps2,01,---,b», we have the following typical relations among the even 
theta squares: 


3 
p=1: > (—1)* 82,0, 82,0, (u) = 0, 


i=1 


4 
P = 2: > (—1)% 83 ,5,5, 92,0462 (u) = 0, 


i=1 
: 2 
p= 3: > (— 1) Bo ,b,5953 Pasbybobs (u) — (—1)°8’8*(u) = 0, 
a= >» ay. 


All of the respectively 3-, 4-, 5-termed relations among the theta squares can 
be obtained from one of these by the repeated use of two operations A and T, 
A being the addition of a half period and 7 being an integral linear trans- 
formation of the periods. 

For p = 4 there are two relations connecting the 12 even theta squares formed 
like the above. These are 





6 6 
I: > (—1)* 82,5, .--b, Ba,b, ---b, (4) - p (—1)*** 83, 83,(u) = 0, 


i=] i=1 
p = 4: . 
II: Zz (—1)** €a,0,---b [92 ,0,---b, Ba, (u) + 8, 830, ---b,(U)] = 0, 
i=1 
where ¢,5,...0, is the sign of the permutation a;b; --- b4 from the natural order 
of these indices. From these for u = 0 we have the modular relations 


6 


III: > (—1)* [85,s,.., — (—1)*8!,] = 0, 


i=1 
6 


IV: z. (—1)** €a,5, «bg Oa by « «by Dag = 0. 
i=1 








of 


of 


on 


or 











CONFIGURATIONS DEFINED BY THETA FUNCTIONS 485 


The last of these enables us to prove the interesting identity connecting the 
squares of the ten even theta functions which define the basis configuration, 
namely, 


10 
V: > (-1)'8793(u) = 0, 
i=1 
whence 
10 : 
VI: > (-1)'8! = 0. 
i=] 


If we multiply I by (—1)* and replace the second sum from V, we get 


6 4 
VII: De (= 1) 5 ,5;..-, 83,5;---4,(u) + DY (—1)83, 83,(u) = 0, 
j=1 


i=1 


6 4 

VIII: D (—1)**** 8! 5,...0, + 2. (—1)"' 8, = 0. 
i=1 i= 

However, VII is merely that identity which arises from V by a transformation T 

which converts the given basis into a new basis. 

Under the modular group M of integral linear transformations of the periods, 
the odd and even thetas undergo the permutations of a finite group M(2) which 
is generated in a very simple way by a conjugate set of involutions. A particu- 
lar generating involution J;,...;,, is attached to a particular half period P;,...:,, . 
This involution leaves P;,...;,, unaltered or transforms it into P,,...i.,3;-.-j2; 
according as P;,...;,, is syzygetic or azygetic with P;,...;,,. This involution 
leaves #j,...;,4:-2, unaltered or transforms it into 9%j,...;,4,~2:é---i2, according 
as this new # has not or has the same parity as the original #. 

The immediate foundation for the isomorphism between the group G,,.(2) 
associated with the Cremona group G,,, and the above modular group M(2) 
is that G,,, and its isomorphic linear group LL, are also generated by a conju- 
gate set of involutions. Included in this set are first the transpositions of two 
variables (tt;) and second Ly,....24:. If nm 2 k + 2, these involutions are in 
the same conjugate set. When L,., is infinite, this conjugate set is also infinite 
but it becomes finite when reduced mod 2. It becomes relatively easy to 
follow the behavior of these involutions under mutual transformation by ob- 
serving that (¢;t;) and Z,...,.4, are defined respectively by the linear forms 
t; — t; and 4; + --- + t4:. In establishing the isomorphism, the transposi- 
tions (¢;) are always identified with the involutions J;; (¢, 7 = 1, ---,n), 
whereas Zy,...,.41 is identified with Jh,...,241,:,...;,, the % --- 7, being such prop- 
erly chosen new indices with n + r = 2p + 2 that the theta involutions and the 
linear involutions combine in the same fashion. The case Gs is a relatively 
simple illustration. We begin with 


(tt;) >t — 4; em (i,j = 1,--- , 8), 


Tyg 32 th + te + ts: Liew, 
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the index 0 being added to make the number of indices of Ji2:. even, whence 
another index 9 is to be expected. On continuing the transformation process, 
we find that 


Lyesse = Lyeslasclieg 2h +--+: + be > Liessss = Tzs00 , 
Ty 2...8 = LyrLaesaseLars : 2t) + te +--+ tbgi Ty. 


It is now easy to verify that Z.; and the transpositions transform these linear 
forms precisely like Ji2 and J;; transform the corresponding half periods. 
This establishes the isomorphism. The theta involutions J are those attached 
to all the half periods for which %(u) vanishes whence Gs,2(2), or its finite iso- 
morphic Gs2 , is isomorphic with that subgroup of M(2) for p = 4 which leaves an 
even theta function 8&(u) unaltered. 

In all of the finite cases of G,,, this abstract isomorphism between G,, (2) 
and M(2) is vitalized by an actual algebraic connection between the set of 
points P,,, and the theta functions concerned. Moreover, the effect of regular 
Cremona transformation upon P* , and the effect of integral linear transforma- 
tion of the periods upon the theta functions, are precisely those which are fore- 
shadowed by the abstract isomorphism. 

However, a necessary condition for this algebraic connection is the existence 
of a system of three term relations among the theta or theta modular functions. 
For, the set P* projectively defines certain pencils of primes with k — 1 of its 
points as a base. Three primes of such a pencil, each on one of the remaining 
points, are linearly related. This relation must therefore appear in the theta 
functions which define the P* . 

For p = 1 and p = 2 these three-term relations are fairly numerous, and in 
the latter case, are interpretable in terms of a Pg and the Weddle surface deter- 
mined by it. In the case p = 3 with indices 1, --- , 8 let 


Pi234(0) = Psezs(0) = Crore = Coors , 
and denote certain products of these C’s as follows: 
CHP = CiaCins (i,j,k = 1,--- , 6), 
CH ™ = CijerC iseC szerC ses - 


Then one notable modular relation takes the form 


3 
(56) (78) 
(R) > + Cy 0. 
In this three-term relation the negative ratio of two terms is a double ratio D. 
If we think of the index 8 as isolated, D is a double ratio in a set P? of the 
four lines from p; to p,, --- , ps. If 8 is not isolated, D is a double ratio in a 
set P, the eight base points of a net of quadrics, of the four planes on either 
the line psps, or the line pryps, to the points p,,---,~,. Another type of 
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three-term relation involves the initial terms of the odd thetas (p = 3) with 
the variables u restricted to the Riemannian case. It yields an irrational equa- 
tion of the quartic envelope with 7 of its 28 nodes at P;. A third type of 
three-term relation for p = 3 arises from a four-term relation by imposing the 
requirement #(u) = 0 on the variables uw, uz, us. Then those of the four- 
term relations which contain 3(u) become three-term relations. The latter 
define a set P; in space and as u varies subject to #(u) = 0 a variable point ps 
is obtained which with P} makes up a Ps which is the nodal set of an 8-nodal 
quartic surface. A fourth type for p = 3 occurs when C = #(0) = 0, and the 
functions are hyperelliptic. Then the three-term relations define a P3 and a 
Weddle 3-way attached to it. Thus even in the relatively simple case of p = 3 
the three-term relations (with the exception of (R) above) are known to exist 
only for special values of the moduli or the arguments. In the case p = 4 
even this one exception seems to disappear. 

The relation (R) has, however, some significance for the case p = 4 which 
appears as follows. If the period cell of the u’s is mapped in 2-1 fashion 
(+u) by means of the theta squares upon a Kummer manifold K, in S14, 
the K, is invariant under an Abelian collineation group g22» whose involutorial 
elements are defined by addition of the half periods, u’ = u + Pi,...:,,. This 
gx» can be represented very simply by transformations with integral coeffi- 
cients and thus is independent of the moduli of the functions. Given then the 
gz» , and one of the 2” singular points of K, which are determined by the half 
periods, say that one, O, determined by u = 0, the K, itself is determined. As 
the 4p(p + 1) moduli change, this singular point configuration runs over a 
modular manifold M of dimension }p(p + 1) in Se_,. For p = 1, and p = 2, 
ip(p + 1) = 2” — 1, and the modular manifold M covers the space of the 
gz». But for p = 3, and given ges in S;, M has the dimension 6, whence not 
every point of S; can be a singular point of a K; with the given gx. If we 
write the relation (R) in the form 

3 
(R,) D+ Vici)? = 0 
i=1 
and express the theta squares in terms of the codrdinates of ges , the rationalized 
form of (R) is the equation of M. 

An involution in g22» , say u’ = u + Pag, has two skew fixed spaces, F; , Fs 
of dimension See; (q = p — 1). Each of these cuts K, in 2” points, these being 
the points on K, defined by quarter periods Q for which 2Q = P.s. The 
elements of g22» which leave an F unaltered effect within it a gox«. Thus the 
quarter period points on a space F of K, behave like the half period points of a 
K, (¢q = p — 1). Now it may be proved that if the half period points on K, 
are defined by the zero values of the theta squares, C7, ---ig+1—9e » then the quarter 
period points of K, in the F defined by Pag are similarly defined by the theta 
products Cj,..-i.4;-2,C i ---ig41-9:8 OF, In the notation just used by 5 ‘ 
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Now Schottky has proved that the condition that the general theta functions 
of genus 4 with 10 moduli be Abelian with 3p — 3 = 9 moduli is 


3 
(56) (78) (90) 
(Re) > + VC$ = 0. 
Comparing this with the above form of (R:), we see that Schottky’s condition 
may be stated as follows: 

The condition that the theta functions of genus 4 be Abelian is that one quarter 
period configuration shall be a half period configuration for functions of genus three. 

For the general functions in four variables no three-term relations have been 
exhibited. For the Abelian functions, however, the set of modular relations 
formed as in (Re) has been utilized by Schottky to define the sets of 9 and 10 
nodes of a symmetroid. In connection then with these sets P3 and Pi and 
their associated sets P} and P{) we have the desired algebraic relation indicated 
by the isomorphisms between G, (2) and the modular group M(2), even though 
the G,,.’s themselves are infinite. 

When the functions are Abelian, another type of three-term relation (p = 4) 
occurs as a special case of a four-term relation because of the further modular 
condition Co = 0. It appears then that, if u;;, is the linear term in the develop- 
ment of the odd Riemannian theta #;;,(u), we will have a linear identity con- 
necting the three root-functions 


V ui V ui,90, V Uigin V uis00, V utigin V ui,90 (i, le, is,j = 2 eee, 8). 
If then we set 
Fas = V Uape V tas V usoo (a, 8 = 1, ---, 8;a # 8), 


it is clear that three Fg with a common index are linearly related and that all 
of the Fs can be linearly expressed in terms of three like Fy. , Fi; , F23. Hence 
the F.s’s can be regarded as the lines joining the points p, , ps of a planar P§. 
As the u’s vary the point z in the plane runs over the locus of the 9-th node of 
sextics with eight given nodes at P. 

The only types of three-term relations which are known for generic p are 
those which exist in connection with the hyperelliptic funetions for general 
values of the arguments. These serve to define the remaining sets P* for 
which G,,x is finite, and the Weddle p-ways associated with them. These sets 
are more specifically P3233 and P3257} . 

In every case in which the set P% appears in algebraic connection with the 
theta functions it is true that the variation in the set P*, obtained by Cremona 
transformation to congruent sets, and the variation in the set obtained by 
integral period transformation, are identical. 
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INVARIANTS 


By HERMANN WEYL 


The theory of invariants came into existence about the middle of the nineteenth 
century somewhat like Minerva: a grown-up virgin, mailed in the shining armor 
of algebra, she sprang forth from Cayley’s Jovian head. Her Athens over which 
she ruled and which she served as a tutelary and beneficent goddess was pro- 
jective geometry. From the beginning she was dedicated to the proposition that 
all projective coédrdinate systems are created equal. Indeed, at that time the 
viewpoint of projective invariance was the one universally accepted in geometry. 
The rise of projective geometry had first been brought about by truly geometric 
stimuli, the study of conic sections, the theory of perspective and by the develop- 
ment of descriptive geometry, and the so-called synthetic direction of Steiner 
and von Staudt has confirmed the fertility of the projective attitude with respect 
to pure geometry. 

However, its gaining such immense preponderance was, if I am not mistaken, 
due to algebraic rather than geometric reasons: namely, to the fact that the 
group of projectivities is expressed by the simplest of all continuous groups, 
the group of all homogeneous linear transformations. Pliicker in the preface 
of his first work (Analytisch-geometrische Entwicklungen, vol. 1, 1828) openly 
espoused the ascendency of algebra, or, as he said, analysis, over geometry. 
So that perhaps one had better speak of geometric algebra than of algebraic 
geometry, namely, of an algebra which, in establishing its theorems and in the 
search for the proofs thereof, uses geometric terms and is guided by geometric 
intuition. The modern evolution, as far as it does not point its needle toward 
topology, has on the whole been marked by a trend of algebraization, notwith- 
standing the undeniable merits of the great school of Italian geometers. 

The dictatorial rule of the projective idea in geometry was first successfully 
broken by the German astronomer and geometer Mébius. One is forced to 
realize that the group of all homogeneous linear transformations is not the only 
one worthy of consideration and capable of serving as the group of automor- 
phisms in a geometric space. Mébius does not yet possess the general idea of a 
group; however, his notion of Verwandtschaft meets the same purpose in each 
special case he considers. The universal group theoretic interpretation was first 


Received April 20, 1939; an invited address before the American Mathematical Society 
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No explicit references to the literature were given in the address; they can readily be 
supplied from the author’s book The Classical Groups, their Invariants and Representations, 
Princeton, 1939. For the general foundations of the theory of invariants compare in 
particular v.d. Waerden, Mathematische Annalen, vol. 113(1936), pp. 14-35. 
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promulgated in plain words by Felix Klein in his famous Erlanger Programm 
1872, which is the classical document of the democratic platform in geometry 
yielding equal rights to each and every imaginable group. The adaptation of 
his standpoint to the study of invariants has been somewhat slow. Before I 
discuss the main problems of the theory of invariants I find it convenient to 
rephrase Klein’s fundamental idea in slightly modernized and hence slightly 
more abstract terms. 

One wishes to associate with the points P of a space numerical (i.e., reproduc- 
ible) symbols x as their codrdinates. In general this is possible in a conceptual 
manner, without pointing out the individual points with my finger, only with 
respect to a frame of reference, e.g., in Euclidean geometry with respect to an 
arbitrarily assumed Cartesian set of axes. Transition from one frame f to 
another equally admissible one is accomplished by means of a one-to-one cor- 
respondence S in the domain of symbols z. One has to deal, therefore, with 4 
kinds of objects: a set of symbols or céordinates x, a group G of transformations 
S of this set into itself, and further points P and frames f. Their connection is 
to be described thus: A point P relative to a frame f determines a coérdinate 
x = (P,f). Any two frames f, f’ determine a transformation S of our group G, 
such that the coérdinate x’ = (P, f’) of an arbitrary point P arises from its 
coérdinate zx = (P, f) by S, x’ = Sx. With two given frames f, f’ the equation 


(1) (Pf) = P,P) 


defines an automorphism P = P’ of the space. If the same S carries f, f’ into 
g, a’ respectively, one has, along with (1), 


(P, 9) = (P’, 9’). 


This shows that the automorphisms of our space form a group isomorphic with 
@; however, the isomorphic correspondence between the two groups depends on 
the frame of reference and is hence determined up to an arbitrary inner auto- 
morphism of the group. If in studying a given group © of transformations 
xz’ = Sz in a domain of symbols z one wishes to make use of a geometric nomen- 
clature, it is quite fitting to invent a point space with its equally admissible 
frames to which the above scheme applies. 

However, in one regard the scheme is still incomplete. Not only are the points 
of the space to be submitted to symbolical representation, but as has been em- 
phasized by Pliicker, other geometric entities also, e.g., the straight lines, may 
serve as spatial elements. Nay, in physics all sorts of physical quantities, 
velocities, forces, field strengths, electronic spins, etc., should be fixed by numeri- 
cal symbols relative to a frame of reference. The law according to which the 
transformation S depends on the transition f — f’ will then be determined by the 
type of the quantity in question, and will differ for points, lines, velocities, spins, 
etc. Only the elements s of the abstract group are tied up with the transitions 
in a manner independent of the type of quantity under consideration. After 
this correction, Klein’s axiomatics looks as follows. (In its description I use the 
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language of physicists: instead of several points, I speak of a point which may 
assume several positions, or rather of a quantity, e.g., the electromagnetic field 
strength, capable of several values.) 

A. The “symbolic” part (dealing with group elements and codérdinates). 

(1) Let there be given a set y of elements called group elements. Each pair s, 
t of group elements shall give rise to a composite element ts. There shall be a 
unit element e satisfying es = se = s and an inverse s ‘for each group element s: 
¢'s=ss =e. (The associative law is not explicitly required.) 

(2) Let there be given a set of elements called coérdinates x and a realization 
%: s — S of the group y by means of one-to-one correspondences x — 2’ = Sz 
within that set. 

B. The “geometric” part (dealing with frames and quantities). 

(1) Any two frames f, f’ determine a group element s, called the transition 
from f to f’. Vice versa, a group element s “carries” a frame f into a uniquely 
determined frame f’ = sf such that the transition (f — f’) = s. The transition 
j — fis the unit element e, the transition f’ — f the inverse element. If s, ¢ are 
the transitions f — f’, f’ — f” respectively, then the composite ts is the transition 
j -_ . 

(2) A quantity q of the type % is capable of different values. Relative to an 
arbitrarily fixed frame f each value of g determines a coérdinate z such that ¢— x 
is a one-to-one mapping of the possible values of q on the set of codrdinates. The 
coérdinate x’ corresponding to the same arbitrary value q in any other frame f’ 
is linked to x by the transformation z’ = Sz associated with the transition 
(f—> f’) = s by the given realization Y. 

(1) refers to the space, (2) to a special quantity therein. 

This sounds fairly general and abstract. As algebraists we are interested 
almost exclusively in the case where the realization of the group is a representation 
s — A(s) by linear transformations A(s) in an n-dimensional vector space and 
where the codrdinate is therefore a k-vector, i.e., any n-tuple of numbers 
(41, -++,2n). By “number” we mean here a number in an arbitrarily given field 
k. With this limitation we repeat once more our definition of a quantity: 

A quantity q of type UX is characterized by a representation of y in k, s > A(s), 
of a certain degree n. Each value of q relative to a frame f determines a k-vector 
(a, --+ , 22) such that under the transition s to another frame f’ the components 
z; of q transform according to A(s). [The representation s — 1 of degree 1 is 
called the identical representation. A quantity of this type is a scalar.] 

For the purposes of differential geometry this set-up is also of basic importance, 
though its does not tell the whole story. Here the procedure consists in asso- 
ciating with each point of the non-homogeneous “‘differential’’ manifold M a 
homogeneous Klein space of fixed type & and in establishing transitions between 
these Klein spaces by moving around in M. For example, in a recent review of 
E. Cartan’s method of répéres mobiles in the Bulletin of the American Mathe- 
matical Society, I was able to show the adequacy of the axiomatic foundation as 
given here for his treatment of manifolds M, that are embedded in a Klein space, 








492 HERMANN WEYL 


by means of differential invariants. But I shall not enter into this subject here, 
my sole concern at present being algebraic invariants. 

I denote by P = P, the “space” of n-dimensional k-vectors (2, --- , 2n). 
A change of the vector basis in P transmutes 9 into an equivalent representation 
%’. WW or the corresponding type of quantities is reducible, provided P has a 
linear subspace P’ invariant under all transformations A (s) of the group & which 
is neither the total P nor contains only the vector 0. By appropriate choice of 
the vector basis one then may split off a part of the components 2; such that these 
transform only among themselves. Decomposition occurs if P can be decomposed 
into two complementary invariant subspaces P; + P.. This means that, relative 
to a suitable vector basis, the components break up into two classes 


(2) 1, °°* ttl Yr, oes > Ym (i+ m = n), 


the members of each transforming among themselves. The corresponding 
quantity consists of the juxtaposition of two quantities x and y which vary 
independently of each other. Thus one may look upon the electromagnetic 
four-potential together with the field strength as one quantity of 4 + 6 = 10 
components; but everybody will agree that this is a very artificial union. Look- 
ing from the other direction one will try and wish to decompose every quantity 
into independent irreducible (‘‘primitive’’) constituents. For most groups, indeed 
for all which will engage our attention here, this is in fact possible. But the 
demonstration by a]gebraic means of the theorem of full reducibility is one of the 
chief goals of the theory. 

Juxtaposition was defined thus: If the variables 2 , --- , x; are subject to the 
substitution A, and y1, --- , ym to the substitution B, then the row (2) undergoes 
the substitution A + B. Another process of great importance is X-multiplica- 
tion: under the conditions just described, the lm products xy, undergo the 
substitution A X B which one calls the Kronecker product. Hence one may add 
and multiply representations %: s — A(s) and 8: s — B(s) of the same group: 


Y+ B: s— A(s) + Bs); YX B: s— A(s) X Bs); 


or what is the same, one may add and X-multiply quantities. In performing the 
second process, the representation spaces P and P’ of 1 and m dimensions over 
which the vectors z and y range, give rise to an /m-dimensional space PP’ which 
contains the vector z = x X y with the components 
Zik = Lyk - 

In studying linear forms in PP’ one often finds it convenient to replace the 
most general vector z with /m independent components 2 by the vector x X y 
with z; and y, as independent variables. This procedure is called the symbolic 
method in the theory of invariants. One of the most important problems for 
quantities is to decompose the product of two primitive quantities (or of two 
irreducible representations) into its irreducible constituents. Special cases will 
soon occupy us. 
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After all these preliminaries I shall finally say what an tnvariant is. I begin 
with the notion of a vector invariant which presupposes that we are given a group 
I of linear transformations A in an n-dimensional vector space P. Suppose we 
are given a form f(z, y, ---), i.e., a homogeneous polynomial of certain degrees 
u, v, --- in the components of each argument vector z, y, --- which vary in P. 
The cogredient transformations 
z’ = Az, y’ = Ay, 
change f into a new form f’ = Af defined by 


f(z’, y’, +++) = f(z, Y, ++). 


fis an (absolute) vector invariant with respect to the group I if f = Af for all 
transformations A in f. A simple generalization of this elementary concept 
will introduce contravariant argument vectors &, n, -- - which undergo the trans- 
formation contragredient to A while the covariant arguments z, y, --- are 
transformed by A. 

To this elementary notion I oppose the general notion of invariants resting upon 
a given abstract group y = {s} and a series of given representations of y, 


A: s — A(s), BS: s— B(s), 


of degrees m, n, --- , respectively. A function g(r, 9, ---) depending on an 
arbitrary quantity r of type YU, another quantity » of type &, - - - will be expressed 
by a certain function F of the numerical vectors 


zm (q, +--+, 2m); y = (Yr, +--+, Yn), 


in terms of a given frame of reference f, and will be expressed by a certain 
function F’ = sF in terms of another frame f’ into which f changes by the 
group element s. If F’ = F for all s, then ¢ is an invariant. If we make use 
of the numerical vectors and the given representations only, invariance may be 


simply stated by the equation 
F(A(s)z, B(s)y, --- ) = F(z, y,.--- ), 


holding for all sin y. Again, we limit ourselves to the case where F is a poly- 
nomial homogeneous in the components of each vector. Another way of ex- 
pressing the same thing would be to say that an invariant is a scalar depending 
on variable quantities of given types A, B, --- . 

One speaks of a relative invariant if F’ = \-F where the multiplier 4 = A(s) 
depends on s only. s — X(s) is then necessarily a representation of degree 1. 
More generally, a covariant is a quantity of a certain type §: s — H(s), depending 
on variable quantities z, y, --- of given types A, B, ---. 

After having fixed the concepts, we can now turn to the fundamental theorems 
concerning invariants. The first main theorem maintains that the invariants for 
a given group y and a given set of its representations W, B, --- have a finite 
integrity basis; i.e., one can pick out a finite number among them in terms of 
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which all these invariants are expressible in an integral rational manner. We 
do not know whether the proposition holds good for any group y and representa- 
tions &, B,---. One has been able to prove it, however, in the most im- 
portant cases, in particular for finite groups y. After one has ascertained a 
finite complete set of basic invariants J,(z, y, --- ), --- , Ja(z, y, --- ), the 
second task is to survey all existing algebraic relations among them. A rela- 
tion is a polynomial R(t,, --- ,&) of h variables ¢,, --- ,t which is turned 
identically into zero by the substitution 
i = Ji(z, y, oo+ ), eee he = Salt, y, --- ). 

The second main theorem states that one can find a finite number of relations of 
which all relations are algebraic consequences. This is merely a special case 
of Hilbert’s universal proposition about the finiteness of an ideal basis for any 
ideal of polynomials. Indeed the relations form an ideal in the ring of all poly- 
nomials of t;,---,&. Thus the second main theorem is settled once for all 
and we shall pay little further attention to it. To be sure, in each single case 
the problem remains actually to ascertain an ideal basis of the relations. 

I give two examples from the elementary domain of vector invariants. Let 
us deal with invariant forms depending on an arbitrary number of vectors 
x, 2, ... in the same space; invariance refers to a given group I of linear 
transformations in that space. If I is the group of all unimodular transforma- 
tions, one gets an integrity basis by forming from the given argument vectors 
in all possible combinations the determinants [zy - - - z] of the components of n 
vectors z, y, ---,2. If contravariant arguments ¢“’, ¢”, --. are admitted, one 
must add the following two types 


(éz) = £121 + as + EnXn 


and [7 --- ¢]. On the other hand, if T is the group of all orthogonal trans- 
formations, then the scalar products 
(1) (2) 
(zy) ‘ = 0 3 es 
of the argument vectors constitute an integrity basis for invariants. Sur- 
prisingly enough the last result holds good even when the underlying number 
field is any field of characteristic zero in the sense of abstract algebra. Since 
the construction of suitable Cartesian coérdinate systems to which the proofs 
resort depends on laying off a given segment on a given line, one would have 
expected the result to be restricted to ““Pythagorean”’ fields. As one knows, a 
field is called real (Artin-Schreier) provided a square sum never vanishes unless 
each term vanishes. I name a real field Pythagorean if the square sum of two 
numbers is always a square. All relations between scalar products are in the 
case of the orthogonal group consequences of the relations of the following type: 
(xx) (r2z’) «+» (xx) 
(x x) (x x’) As (x 


(Second main theorem for orthogonal vector invariants.) 
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By means of his theorem about polynomial ideals Hilbert had reduced the 
general proof of the first main theorem to the construction of a linear operator w 
working on polynomials F(z, y, --- ) and having the following two properties: 


(3) w(1) = 1, w(F-J) = w(F)-J 


whenever J is an invariant. If y is a compact Lie group, one can follow a 
procedure inaugurated by Adolf Hurwitz and define an invariant measure of 
volumes on y by means of which one is able to form the average 2,{y(s)} of 
any continuous function ¥(s) on y. One sees at once that 


w(F) = M,(sF) 


is a process of the desired nature. By this topological method which necessarily 
presupposes the continuum K of all real numbers as reference field, one succeeds 
in proving the first fundamental theorem for any compact Lie group. I mention 
the instance of the real orthogonal group in K. By the same method I. Schur 
succeeded in carrying over from finite to compact Lie groups Frobenius’ theory 
of group representations, in particular, the orthogonality relations for the repre- 
senting matrices and their characters, while the speaker, together with F. Peter, 
established the completeness relation. 

A. Haar freed the definition of the volume measure of the awkward differ- 
entiability conditions imposed by the Lie nature of the group. H. Bohr’s 
theory of almost periodic functions could be interpreted as the simplest example 
of a similar theory for open, non-compact groups, namely, for the group of 
translations of a straight line. With the theory of compact groups and Bohr’s 
example of a non-compact group before his eyes, von Neumann established the 
theory of almost periodic representations, their orthogonality and completeness, 
for any group whatsoever. Hence the first main theorem for invariants is proved 
for each group as long as we restrict ourselves to quantities z, y, --- as argu- 
ments whose types are described by almost periodic representations. 

All this sounds as if we could rest as God did after the sixth day of creation, 
finding that it was very good! But now enters the snake into the paradise. 
Let us once more envisage the classical case of the group L’ of all real uni- 
modular transformations A in n dimensions. Not one of the representations 
with which the classic theory of invariants deals, not even the representation 
A — A, is almost periodic! Thus the “almost periodic” theory fails just in the 
most important and natural cases. Nevertheless it has been possible to make 
the theory of compact groups fruitful for all semi-simple Lie groups by what I 
have called the unitarian trick. For the group L’ it consists in first extending L” 
to embrace all unimodular transformations with complex coefficients and then 
limiting oneself within this wider group L* to the unitary operations. By 
following Lie’s fundamental suggestion and going back to the infinitesimal ele- 
ments of a group, one linearizes and thereby algebraizes all problems concerning 
structure, representations and invariants of a group; and then such reality 
restrictions as the two encountered above, either to real coefficients or to the 
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unitary subgroup, become irrelevant. Hence each of these subgroups can stand 
for the other, and one of them, namely, the unitary subgroup, is compact and 
thus accessible to the integration method. In the linkage between the infini- 
tesimal and the total group a topological element is involved; but I shall not 
dwell here on this subtle point. Anyway I have been able to show that the 
unitarian trick is effective with all semi-simple Lie groups, and thus not only to 
confirm by a combination of the infinitesimal and integral methods the results 
derived in a purely infinitesimal manner by E. Cartan for the irreducible repre- 
sentations of the semi-simple groups, but also to supplement them by the 
theorem of full reducibility and explicit formulas for their characters. At the 
same time the first main theorem for invariants was thus secured for all semi-simple 
groups. 

The problem naturally puts itself: to corroborate by direct and explicit alge- 
braic construction these results first obtained in a transcendental way. If one 
succeeds, one may hope at the same time to remove the bond by which the 
topological approach ties these results to the field K of real numbers and to 
extend them to any field in the abstract algebraic sense, at least to any field of 
characteristic zero. This is a goal at which I have aimed for many years, 
though not at all with the necessary persistence and singleness of purpose. So 
many other mathematical things have diverted my interests, and the whirlwind 
of political events has had a most disturbing effect on my concentration. How- 
ever, younger men came to my aid, above all Richard Brauer, to whom I owe 
the most essential link in the chain of the algebraic theory. At present I have 
come to a certain end, or at least to a certain halting point, from which it seemed 
profitable to look back upon the track so far pursued, and this is what I tried 
to do in my recent book The Classical Groups, their Invariants and Representa- 
tions. The most important simple groups in the field of all complex numbers 
are: the group L(n) of all (non-singular or merely of all unimodular) linear trans- 
formations in n dimensions, the group O(n) of all (or all proper) orthogonal 
transformations in n dimensions, and the group Sp(n) of all linear transforma- 
tions in n = 2v dimensions leaving invariant a non-degenerate skew-symmetric 
bilinear form. The last I have christened the symplectic group. These are 
even the only ones, apart from 5 quite singular exceptional groups. I shall 
deal exclusively with these groups L(n), O(n), Sp(n). For their investigation 
a finite group, the group of all permutations, must be drawn in, and one could 
also include the alternating group of permutations. These groups are in my 
mind when I speak of classical groups. We are first engaged in algebraically 
constructing the possible types of quantities under their reign. 

Again we start with the universal linear group L(n), an arbitrary element of 
which we denote by A: 


a; = > a(tk)x (i,k =1,---,n). 


You all know what a tensor of rank r is. It has n’ components t(2; 22 - - - #) 
labeled by r indices i; , iz, --- , 7, ranging from 1 to n; under the influence of 
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the transformation A of the coérdinates in the underlying vector space these 
components are transformed according to the substitution 


1(A)=AXAX-:---XA (r factors) 
or more explicitly 
(4) t’ (a; rer ir) = P > a(i; ky) — a(i,k,) -t(ka a kr). 


The generic tensor of rank r is the quantity arising by r-fold X-multiplication 
of the quantity vector. But the space P’ of all tensors is not irreducible under 
the group II,(Z) consisting of the substitutions II,(A) which are induced in 
tensor space by the elements A of L(n), whereas the words symmetric tensor, 
skew-symmetric tensor, indicate irreducible quantities. The tensor space P’ 
must therefore be split into irreducible invariant parts by imposing symmetry 
conditions upon the tensors. The possibility of doing so is based on the fact 
that one can perform an arbitrary permutation p on the r indices or arguments 
ii, --+ , t, Whereby t changes into another tensor pt. In this way enters the 
group x, of permutations p of r figures 1, ---,7. Associating the transition 
t — pt with p defines a representation of x, by linear transformations in P’. 
But why is it that these permutation operators are of importance for the decom- 
position of tensor space into invariant subspaces? One understands this if one 
replaces the group II,(L) of the substitutions (4) to which the tensor space is 
submitted by its enveloping algebra, containing all those substitutions which 
can be gained by linearly combining any finite number of substitutions of the 
group II,(Z). It is easily seen that the enveloping algebra consists of all linear 
substitutions t — Ht commuting with the permutations p: t — pt. The group 
r, of permutations may also conveniently be replaced by the enveloping al- 
gebra, i.e., by the corresponding group ring whose elements 


> a(p)-p [a(p) numbers] 


may be interpreted as “symmetry operators’ working on tensors. 

The general situation under which our problem is naturally to be subsumed is 
now this: Instead of the tensor space we consider an arbitrary vector space P 
whose vectors are called t; there is given a finite group y = {p} and a representa- 
tion of y in P representing the abstract group element p by a linear substitution 
t— pt. Weare interested in the algebra & of linear operators t— Ht commuting 
with all operators t — pt of y. The regular representation r of a finite group y 
or of its group ring (y) = p has p itself as its representation space, representing 
any element a of p by the transformation zx — az of p into itself. By a well- 
known theorem due to Maschke the regular representation of y is fully reducible; 
this holds good in any field, unless it is of a prime characteristic dividing the 
order of y. We take into account only fields of characteristic zero. A thor- 
oughly elementary method permits establishment of a complete parallelism 
between the subspaces of p invariant under r on the one side and the subspaces 
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of P invariant under the algebra 2% on the other side. The parallelism is faithful 
with respect to addition and the relation of being contained for invariant sub- 
spaces, and also with respect to equivalence under their respective operator 
algebras r and Y. 

For the symmetric group 7, one knows how to carry out the decomposition 
into irreducible invariant subspaces by means of the symmetry operators which 
were invented by A. Young and later, under the leadership of E. Wigner, have 
found such surprising applications in quantum mechanics. Let us attach the 
word quantics, originally coined by Cayley, to the quantities which one prepares 
in this way from the material of tensors under the rule of the full linear group 
L(n). The domain of quantics is closed with respect to the two most important 
operations: (1) X-multiplication of two quantics followed by decomposition into 
irreducible constituents, (2) transition from a representation to its contra- 
gredient. Each Young operator and hence each quantic’ is characterized by a 
partition of the rank number r into n integral summands 


r=ntrt--- +m (1 2m2--- 2m 20). 


We represent this partition by a symmetry diagram whose rows have the lengths 
T1,1,°++,?.. Example: 


m=7 OOO0O000O 
rz = 5 OOOOO 
tr7=5 OOOOO 
™m™=2 00 

tr = 1 O 


If one wishes to employ a similar method for the orthogonal and the symplectic 
groups one has first to get hold by a simple description of the enveloping algebra 
of the substitutions II,(A) induced in tensor space by the elements A of these 
more limited groups O(n) and Sp(n). The problem is not as trivial by far as 
in the former case of the full linear group L(n), and R. Brauer succeeded in 
solving it only by resorting to the general theory of matric algebras. If one is 
given an algebra % of linear substitutions or matrices A in a certain vector 
space P, then the matrices B commuting with all matrices A of the set % form 
in their turn an algebra 8 which I call the commutator algebra of A. The key 
principle asserts that if %& is fully reducible, the commutator algebra of the 
commutator algebra of % is not larger than % as one might expect, but coincides 
with &. This principle holds in any field. It is the crowning result of a theory 
of matric algebras based on this fundamental advice due to I. Schur: along with 
a given matric algebra, always consider its commutator algebra. Unable to 
refer to any other place, I had to incorporate in my book this theory which has 
become a central issue in the whole non-commutative algebra. Perhaps many 
a reader will find such a concrete treatment in terms of matrices more easily 
accessible than the abstract handling of semi-simple rings. 


1 We disregard here a slight modification necessary to work the process (2). 





oo. mm =.=: «06a 
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If one replaces the group II,(O) induced by the orthogonal group O(n) in 
tensor space by its enveloping algebra thus determined, one succeeds again in 
decomposing the most general tensor into quantics with respect to the orthogonal 
group. The primitive quantics to which one finds oneself reduced differ from 
those for the full linear group in two essential regards: (1) The Young symmetry 
operator is applied not to an arbitrary tensor, but to the most general tensor 
whose 3r(r — 1) traces vanish; the (12)-trace of a tensor t(i; 72 73 -- - 7,) being 
given by 

tio(is--- ie) = Di t(iits --- i) 


(process of Verjiingung). (2) While all symmetry diagrams whose first column 
had a length < n were admitted for the full linear group, only those occur here 
whose first two columns have a total length S n. Similar results obtain for 
the symplectic group which in many regards is easier to handle than the or- 
thogonal group. 

The algebraic concept of invariants which we adopt for the classical groups 
is that of a scalar depending on one or more independent variable quantics. 
Arbitrary forms such as occurred as arguments in the classical theory of in- 
variants are identical with arbitrary symmetric tensors, and under the reign 
of L(n) these are quantics which correspond to a symmetry diagram of one row 
(the length of the row being the order of the form). Let us first stick again to 
L(n)! In all textbooks on our subject one is told a lot about the so-called 
symbolic method which reduces form invariants to vector invariants. On the basis 
of the above analysis one shows readily that the method still works for invariants 
of variable quantics of any type. However, since the number of argument 
vectors to be introduced depends on the degree of the invariant under considera- 
tion with respect to the variable quantics, the reduction to vector invariants is 
by no means sufficient without further resources for a proof of the first main 
theorem. Certainly the importance of the symbolic method whose formal 
elegance nobody will deny has been greatly exaggerated. I consider it one of 
the more glaring examples of the power of tradition and inertia in mathematics 
that the elementary textbooks on invariants up to this day deal almost exclu- 
sively with this method and its applications. Frequently quite different ap- 
proaches, e.g., irrational methods, lead much faster to the goal. Hilbert’s 
general proof dealing with form invariants of the group L(n) is based on his 
general proposition about a finite ideal basis for polynomial ideals and employs 
as the above mentioned process w with the properties (3) Cayley’s purely alge- 
braic 2-process. The method goes through in any field of characteristic zero 
even if quantics instead of forms are the arguments. 

The projective geometers were able to cope with affine and metric geometry 
by adjoining some entities given once for all which they called the absolute: 
the plane at infinity and the absolute involution. In the same line of ideas 
relativity theory has found it convenient to treat the metric vector space as 
an affine vector space in which a positive quadratic form is appointed as metric 
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ground form. Is this standpoint justified as far as invariants are concerned? 
Let variable forms u, v, --- be our arguments in the invariants J(u, », --- ) 
which we envisage. Then the question means whether any orthogonal invariant 
J(u, v, «+ ) arises by the Cartesian specialization g;, = 6; from a L-invariant 
J (gix ; u, v, «-+ ) involving besides the arguments uw, v, - -- a variable covariant 
quadratic form 


(5) » Giz EE. 


That it can answer this question in the affirmative goes to the credit of the 
symbolic method and is its highest triumph. Indeed, each orthogonal vector 
invariant is expressible in terms of the scalar products (ry) of the vectorial 
arguments, and if we replace the scalar product 


(6) DY +--+ + UnYn 


by the L-invariant formation 


which depends on (5) besides the two vectors z and y and changes into (6) by 
the specialization gi, = 6% , then we attain our ends, first for vector invariants 
and then, owing to the symbolic treatment, for form invariants. The procedure 
remains applicable even for arbitrary quantics, although the quantics for the 
group L split into more primitive quantics under the orthogonal subgroup 0. 
In this purely algebraic way based on the adjunction argument we master the 
orthogonal and the symplectic invariants. This procedure has even stood the 
test in certain special cases where the statement of full reducibility breaks down. 

In these days the angel of topology and the devil of abstract algebra fight 
for the soul of each individual mathematical domain. As to the decompositions 
into invariant subspaces whose algebraic construction has here been indicated, 
one would like to know in some explicit way with which multiplicity each of the 
inequivalent irreducible constituents occurs. This question is answered most 


readily if one replaces the representations by their characters. Explicit formulas ' 


for the characters are much more easily obtained by the integral-topological 
approach. The identification of the characters thus derived with the alge- 
braically constructed representations to which they correspond causes a little 
headache; however, if this has been remedied one has seized upon a result which 
by its very nature is independent of the nature of the reference field, and that in 
spite of the fact that the topological method operates in the field K of all real 
numbers. Thus we are face to face with a peculiar application of analysis to 
purely algebraic problems whose stage is set in an arbitrary field. The multi- 
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plicities just mentioned yield at the same time formulas for the explicit enumera- 
tion of invariants and covariants. This field has recently been tilled with high 
success by Professor Murnaghan. 

I must forego giving examples of such enumerating formulas. Instead I 
prefer to mention a by-product of the algebraic investigation. In the n*-dimen- 
sional space of all matrices A = || a, || the equations 


(7) >> Anan, — 5; = 0, det (an) —1=0 
k 


define a certain algebraic manifold, the proper orthogonal group O*. This 
manifold is irreducible, or, what is the same, the ideal of all polynomials (a,x) 
vanishing on O*(n) is a prime ideal. The polynomials constituting the left 
members of the equations (7) form an ideal basis for our ideal, and Cayley’s 
rational parametrization of orthogonal substitutions A, 


A = (E — S)(E + 8)", 


in terms of an arbitrary skew-symmetric matrix S yields a generic zero (all- 
gemeine Nullstelle) of the ideal, provided the elements sx (¢ < k) of the skew 
matrix S = || sj || are treated as indeterminates. All this holds good in any 
field of characteristic zero. 

I hope my sketch has shown how closely the investigation of the invariants 
of a group is tied up with the ascertainment of its representations. This 
connection with the general theory of representations and of matric algebras 
has carried new life-blood into the older theory of invariants which thus has 
joined the modern forward movement of algebra and now participates in its 
general conceptual structure. I feel bound to add a personal confession. In 
my youth I was almost exclusively active in the field of analysis; the differential 
equations and expansions of mathematical physics were the mathematical things 
with which I was on the most intimate footing. I have never succeeded in 
completely assimilating the abstract algebraic way of reasoning, and constantly 
feel the necessity of translating each step into a more concrete analytic form. 
But for that reason I am perhaps fitter to act as intermediary between old and 
new than the younger generation which is swayed by the abstract axiomatic 
approach, both in topology and algebra. 

In closing I should like to point out a few lines of probable further advance. 
First, one naturally wishes to do all things also in a field of prime characteristic. 
Secondly, it is desirable to find all inequivalent irreducible representations in an 
arbitrarily given field; it is doubtful enough that they are exhausted by the 
quantics, though these form a class of quantities algebraically closed in a certain 
sense. If one replaces a continuous group by its infinitesimal elements, one has 
to deal with a Lie algebra and one will ask for its representations and invariants. 
The classical groups together with the 5 exceptional groups mentioned above 
yield all the simple Lie-algebras in the field of complex numbers, or in any 
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algebraically closed field. However, this does not remain true in an arbitrary 
field. In this question Landherr, A. A. Albert, Jacobson and Zassenhaus have 
recently made much headway. So Iam confident that in a few years a younger 
algebraist will be able to write a similar book dealing comprehensively with the 
representations and invariants of all semi-simple Lie algebras in an arbitrary 
field. 


INSTITUTE FOR ADVANCED StTupy. 











ADDITIVITY AND CONTINUITY OF PERSPECTIVITY 


By Isrart HALPERIN 


Introduction. If L is the set of linear subspaces of a projective geometry and 
D(a) is the common dimension function’ defined for the elements of L, it is 


easily shown that 
(1) D(a + b) + D(ab) = D(a) + D(b) for all a, b in L. 


But if D(a) is defined’ to be the common dimension of a plus 1, then D(a) 
satisfies not only (1) but also’ D(0) = 0 and hence 


(2) D(a + b) = D(a) + D(b) if ab = 0. 
More generally, it will follow that for every finite m 

(3) D(d, a,) = > D(a,) 

if a, --- , @m are linearly independent. Since in a given projective geometry 


there is a finite upper bound for the number of elements which can all be different 
from 0 and be linearly independent, there is no point in considering (3) for 
non-finite sums. 

J. von Neumann has given a remarkable set of axioms defining a class of 
geometries which includes all projective geometries as well as a new type of 
geometry which he has named continuous geometry.’ He has shown, too, that 
in each of these geometries there exist a dimension function D(a) and a concept 
of independence such that (3) holds for all sets of independent elements whether 
their number is finite or not. And in continuous geometries there do exist 
countable sets of elements which are all different from 0 and are independent, 
but there are no such non-countable sets. In all of these geometries two ele- 
ments a, b have the same dimension if and only if a can be mapped into b by a 
perspectivity (denoted by the symbols a ~ b).° 

Among the axioms given by von Neumann to define projective and con- 


Received September 28, 1938; in revised form, March 23, 1939. 

1 That is, dimension of a point = 0, dimension of a line = 1, etc. The subspace of L 
which does not contain any points has common dimension = —1. 

* It would be even more suggestive, from the point of continuous geometries, to define 
D(a) to be [(common dimension of a) + 1]/[(common dimension of L) + 1]. 

’ The same symbol will be used to denote both the number 0 and the empty subset of L, 
but there should be no confusion. 

‘See [3], [4], [5]. (Numbers in brackets refer to the bibliography at the end of the 
paper.) 

5 The actual construction of the dimension function is closely related to this property. 
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tinuous geometries there is an axiom of irreducibility.° This axiom of irre- 
ducibility is essential for the construction of the dimension function, but it is not 
needed in the definition of perspectivity nor in the definition of independence, 
This makes it possible to formulate (3) in a way that has meaning in all reducible 
geometries, that is, in all systems which satisfy all of von Neumann’s axioms 
with the possible exception of the irreducibility axiom. The new formulation 
states that for every set I of indices 


(4) If a. , ae J, are independent and b, , a e J, are independent, and if aa ~ b, 
for every a e J, then p+ da ™ y ba ; 

ael ael 
that is, that perspectivity is unrestrictedly additive in reducible geometries. 
(4) is significant for all sets J of indices since in reducible geometries there can 
exist sets of elements of any given cardinal power which are all different from 0 
and are independent. 

The present paper will establish this unrestricted additivity of perspectivity 
as well as the continuity of perspectivity (described below). These properties 
of perspectivity have been established previously by the writer for countable 
sets of elements,’ and the methods used will be simplified here considerably and 
extended to give the general theorems. In §1 the axioms and definitions to be 
used will be stated precisely. In §2 the theory of independence and other 
preliminary theory of von Neumann will be recalled. The continuity and 
additivity of perspectivity will be established, as stated, in §3 and the axioms 
on which the proofs depend will be somewhat weakened in §4. 


1. Axioms and definitions.” 

DEFINITION 1.1. A reducible geometry is a system L of elements a, b, ¢, --- 
with a relation a S b (equivalently, b = a) defined for certain pairs of elements 
of L and satisfying the following axioms. 


Axiom I. PARTIAL ORDERING. 
I,, a3 b,b Sc together imply a S c, and 
I, a < b,b S aare together equivalent toa = b. 


Axiom II. CompLetengess. For every subset of elements a., a € 1, there exists 
II,;. a sum element ZZ. aq such that for every ain L,a = b ™ aq if and only if 
ael ael 
a = a, for everyael. 
II,. an intersection element II Ga such that for everyainL,a Ss II a. if 


ael ael 


and only if a S aq for everyael. 


6 See [3], Axiom VI. 

7 See [1], §6. 

8 The axioms and definitions of this section will be found in slightly different but equiva- 
lent form in [3]. 
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Axiom III. Continuity. For every ordinalQ and for every set of elements aq, 
a < Q, and for every b, 


I). b Dae = > (b Dd az). 
a<2 ¥<2 asy 
lk. b+ I] a, = IT (6+ I] a,). 
¥< 


a<Q asy 


Axiom IV. Moputarity. For alla, b,c in L, (a + b)e = {a + (a + c)b}e, 
or what is equivalent, a S c implies (a + b)c = a + be for all b. 


Axiom V. CoMPLEMENTATION. For any a, b, c in L such thata Sb Sc, 
there exists at least one element, which may be denoted by [ec — b]., such that 
b+ [c — bl, = cand bic — b), = a. 


Remark 1. This set of axioms is self-dual, that is, it is unchanged if S$, >>, 
[] are everywhere replaced by =, [], 0, respectively. Hence any theorem 
which holds for <, >>, [] will hold also for the dual theorem which arises from 


such replacements. 
Remark 2. If aq ranges over all the elements of L, the elements 0 = II a. 


and 1 = ze a, will have the property that, foreveryain L,O SaZ1. We 


shall frequently write [c — b] in place of [ce — b}o. 
DEFINITION 1.2. A set of elements a., ae€J, is said to be independent, 
written (a,, aeJ)1, if aa( > a3) = OforeveryaeJI. We shall frequently 
Bel pxa 
write = @ a, in place of ps a, to denote that the a, , a «J, are independent. 


ael ael 
DEFINITION 1.3. Two elements a, b are said to be perspective, written 
a ~ b, if there exists any element c (called an axis) such thata +c = b+ ec 
and ac = be. If a ~ b; for some bh; S b, we write a < b. 


DeFIniTIon 1.4. If a, is defined for all a < some limit ordinal 2, and 
II Zz ag = p > Il a3 = a, we write lim a, = a. 
Q 


a<Q Bra a<Q Bea a- 
Remark. If aq S ag for alla S B < Q, lim a, exists and is equal to yy é.. 
a<Q 


si a—Q 


Ifa, = ag for alla S 8B < Q, lim a, exists and = II On. 
a—Q a<Q 


2. Preliminary theory.’ 
THEOREM 2.1. If (aa, a€I)1 and ba S ag for all a eI, then (ba, aeTD)1L. 


* The following theory of independence and perspectivity will be found in [4]. Our 
definition of ~ is equivalent to the definition used in [4] by the Theorems 2.3 and 2.2 of [4], 
Part I. Our Theorems 2.2, 2.4, 2.6, 2.8, 2.10, 2.11, 2.12, 2.13 are identical with, or follow 
immediately from, the Theorems 2.3, 2.6, 2.4, 3.6, 3.3, 3.4, 3.8, 4.4, respectively, of [4], 
Part I. Our Theorem 2.7 is implied by the proof of Lemma 3.3 of [4], Part III, and is given 
explicitly as Lemma 3.3 in [2]. The transitivity of perspectivity, the first part of our 
Theorem 2.14, is the Theorem 2.3 of [4], Part III, and is proved with the use of only the 
“countable” axioms in [1]; the rest of our theorem follows as shown in Theorem 2.5 of 
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THEOREM 2.2. (a.,aeJ)1 if and only if (aa, a¢J)1 for every finite subset 
J of I. 


THEOREM 2.3. For every limit ordinal Q, (aa, a < Q)L if and only tf (a;, 
B <a) foralla < Q. 


TuHeoreM 2.4. If (aa,aelI)1,if Js, 8 €P, are a family of subsets of I and J 


is the set of indices common to all Js, B ¢ P, then JT] (D> ae) = > ae. 
BeP aeJdg aeJ 


THEOREM 2.5. If (a, aeI)1 and Js, Be P, are mutually exclusive subsets 
of I, and if bs = >. aa, then (bs, BeP)1L. 


aeJg 


> Gag and if 


Bela 


TuHeoreM 2.6. If (das, BeIa)l for every aeP, if aa 
(ag, ae P)1, then (dag, Bela, aeP)L. 


TuroreM 2.7. If for some ordinal Q decompositions dg = dai © Ga are 
defined for all1 S a < Q, and if a, = II a. for all limit ordinals y < Q, then 


= ’ a<y 
m= %@ > @a,. 


a<Q 


THEeoreM 2.8. If (a2,b3,a < 2,8 < Q)L, andif a, ~ ba foralla <Q, 
then > @ a4 ~ > @ ba. 
Q 


a<Q a< 


TuHeoreM 2.9. Ifa ~bandc 2 a + b, there exists an element x such that 
a@z=bO2r= Cc. 


TuHeoreM 2.10. If a ~ b and q S&S a, thena, = b. 
THEOREM 2.11. a ~ 6b, b ~ cand (a, b,c) _L imply thata ~ c. 


TueoreM 2.12. If (a,,n = 1,2, --- )L anda, ~ Gn: for all n, then a, = 0 
for all n. 


THEOREM 2.13. a « b,b « aare together equivalent to a ~ b. 

TuHeoreM 2.14. Ifa~bandb~c,thna~c. Ifa « bandb « ¢, then 
aac. Thedualtoa « bis equivalent tob « a. 

3. Continuity and additivity of perspectivity. 

TuHeoreM 3.1. If ab = 0, then (a + c)b « c for all cin L. 


Proof. Replacing a by [a — ac], we may assume that ac = 0. Then 
(a + c)b ~ (a + b)c with axis a since 


a(a + c)b = ab = 0, a(a + b)c = ac = 0, 
[4], Part III. Finally, our Theorems 2.1, 2.5 follow easily from the definition of 1, our 


Theorem 2.3 follows easily from our Theorem 2.2, and our Theorem 2.9 follows from the 
Theorem 3.9 and the definition of ~ of [4], Part I. 
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and, using the modular axiom, 
a+ (a+c)b = (a+ b)(a+c) =a+ (a+ Dddec. 
Since (a + b)c S c, we have (a + c)b & c as required. 
THEOREM 3.2." Ifa @®b«c@dandc « a, thenb « d. 


Proof. Since perspectivity is transitive (Theorem 2.14), we may assume 
thta+bxc+d. Replacing aby [(c + d) — (a+ b)] + a, we may assume 
thta ®b=c@d. If nowc ~ aq S a, we can replace a by a; and b by 
b + [a — a,]. Thus we may assume that a ~ c, and hence by Theorem 2.9, 
a@®@xr=c@x=a@b=c @d for some zx Then b ~ 2,2 ~d, and 
Theorem 2.14 implies that b ~ d. The theorem follows from this. 


THEeorEM 3.3.° Ifa « c,b « d, and ab = cd = 0, thna@®b«ec@d. 
In particular, ifa ~ candb ~ d, thna®@®b~ce@d. 

Proof. Lete=[(a+b+c+d) — (a+ 5d)],f = [(a+6+c¢+d) —(c+d)]. 
Thene@a@®b=f@ece@d. Since b « d, Theorem 3.2 implies that 
f@®c«e@a. Since a « c, Theorem 3.2 implies that f <« e. Frome ® 
(a ®@ b) = f @ (ce @ a) we finally derive by Theorem 3.2 thata © b « c¢ @ d, 
as required. 

If a ~ c and b ~ d, we can derive that a @ b « c @ d, and in the same 
way,c ®d «a@b. From Theorem 2.13 it then follows thata @b~c @d. 


THEOREM 3.4. Ifa «b+ c, thena = a, @ a2 witha, « band a; « c. 


Proof. Since perspectivity is transitive, we may suppose that a S b + c. 
Set a, = ab and az = [a — a]. Then a = a; © a2. Now a S&S b implies 
a, « b, and agb = 0, ag S b + c imply, by Theorem 3.1, ag « c. This proves 
the theorem. 


TueoreM 3.5. Ifa « b,andb. = bsforalll Sa SB < Q, then I] >. =0 
a<Q 
implies that a = 0. 
Proof. 1. If 2 has a predecessor A, then bk = II b. = O anda « by, and 


a<Q 
this implies that a = 0, as required. Hence we may assume that @ is a limit 
ordinal. Replacing b, by b. for all limit ordinals y S 2, we may also 


a<y 
suppose that b, = II }. for all limit ordinals y < @. 
a<y 
2. Since perspectivity is transitive, we may assume that a S b:. Now 
define dg = dba, @qg = [da — Gay]. Then aa = Gay: © a’, for alla < 2, and 
a, = ab, = a(]] db a II (ab.) = II ae for all limit ordinalsy < @. This 
a<y a<y a<y 


implies by Theorem 2.7 (use the fact that ag S be = 0) that a = a, = = @ a. 
a<@ 


1° Theorems 3.2 and 3.3 are the Theorem 2.4 of [4], Part III, slightly strengthened. 
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3. Now suppose that for some w < @ the following statements hold for all 
A <w: 

(1), ¢4 is defined and S$ <b, and c, ~ a, for all a < X, and 

(2), (ag, a <Q, cp, B <A)L. 
If w is a limit ordinal, it is clear that (1), and, by Theorem 2.3, (2), also hold. 
Suppose now that w has a predecessor, sayw = y + 1. Define 


d=(b.4.-{> ea. 0(L @ a. ®@ L © ca)brss}]. 


a>y 


A suitable application of Theorem 2.7 shows that 


(2 da a)bys1 = ’ Ce Ga a) Qy41Dy 44 = 0; 


easy 
hence, by Theorem 3.1, (>. a, + he Ca) O41 « Zz Ce. By Theorem 2.8, 
asy a<y a<y 


/ , — * 
p> ®@ ta ~ > @®a,. Hence, from the transitivity of perspectivity and 
a<y a<y 


Theorem 3.3, 
(S @a.0 d Ocr)bu® D> @a« } Oa, DY Oauy. 


assy a<y a>v¥y a<y a>yY 


- ’ ’ ’ _ ‘ ° » ’ 

Since >> @ a, @ > 6a, 0 a, =a « b,,,, Theorem 3.2 implies that a, « d. 
acy a>? F P P P 

Hence we can define c, with c, & d S by41 Sb, anda, ~ c,. Thus (1), 


will hold. Furthermore, 


(2 aa + Di ca) = cydbyu( 2) ae + Di ce) 


a< a<y a<y 


= c,d| La + (2 as + Lo c2a)by1} = 0, 


a>y a<y 


implying (use (2), and Theorem 2.6) that (a,,a<2,c3,8<w)L. Thus (2), 
also holds. Thus, by induction on w, we can define c, for all a < @ so that 
(1)e and (2)¢ hold. 


If we now setc = >. @c,, wehavea @c S b;, hencea @c « by, and also, 
a<Q 


by Theorem 2.8, a ~ c. Thus, under the hypotheses of Theorem 3.5, there 
exists an element c such thata ~ c anda @®c « by. 

4. If, for any fixed a, we replace all bs, 8 < a, by b. in the preceding paragraphs, 
it follows that there exists an element c, such that a ~ c, anda @ c, « ba. 
From the transitivity of perspectivity and Theorem 3.3 we derive ¢ ~ Ca, 
a@c~wa@ec,, and hencea @c « b, foralla < Q. 

5. If we now apply the reasoning of the preceding paragraphs to a @ c in 
place of a and iterate this process, we obtain an independent infinite sequence 
of elements, each perspective to a. By Theorem 2.12 this implies that a = 0, 
and the theorem is proved. 
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THEOREM 3.6. Ifa « b,andb, 2 bs foralll Sa S8<Q,thena« II bz. 
a<Q 
Proof. 1. As in the proof of Theorem 3.5 we may suppose that 0 is a limit 
ordinal and that by = |] 6. for all limit ordinals y < Q. Define b, = 
a<y 


[ba — basi]. Then, by Theorem 2.7, b. = pm ®@ b; @ be for alla S Q. 


a<p<a 
2. Set c = abe and define a; = [a — cl], bg = [bo — c]. Then (a, bo, c)L. 
Now suppose that for some w < Q the following statements hold for all A < w: 
(3), ax, rir, bd, bY’, ax, BY are defined and a, ~ by. 
(4), @ = Qui @ a,, by = bat’ @ by. 
(5, a = I] a, bi = I] bo if \ is a limit ordinal. 


a<aA acr 


(6), yy. & p bi. 


A<a<Q 
Then, as we shall show, a. , dus1 , 63 , b8*", ay, , b&’ can be defined in such a way 
that (3).—(6). also hold. 
If w has a predecessor, say w = y + 1, then a, , bg are defined by (3), ; if 
is a limit ordinal, define a, = I] G., 09 = Il be. It is clear that (5), will 


a<w a<w 
hold. Whether w is a limit ordinal or not, we have by Theorem 2.7, 
, , w c 
a=aq@c=a,@ 2. Qa Be & bey = >» ba ® ba ©  ® ba’ @c. 

a<w a<a<Q a<@ 
= , , P 1 —— = 
Since (c, a;, bo) 1 and a, S a and bg’ S bg for all a, it follows by Theorems 

» , B? ‘ 

2.1 and 2.6 that (c, ag, a < w, bo, 8B < w)L. Hence by (3)2, a < w, and 
Theorems 2.8 and 3.3 


> @a,.@cr~ > @ bi’ @c. 


a<w a<w 
Then by Theorem 3.2, 


ay & Zz b. @ bg. 


ao<a<Q 
rp P . , e 
By Theorem 3.4 we can define a,,; , @, , 69’ such that 


Qy = Agi © a,,, Aor © =< b., a, ~ bf’ < bs. 
w<a<Q 
If we define bg*' = [bg — bg’], it follows that all (3),—(6), will be satisfied. 
By induction on w, we can suppose that (3).—-(6). hold for alla <2. Then 
we can define ag, bg to satisfy (5)e . 
3. From (6)a, @o9 S das « DD, by for all a < Q; by Theorem 2.4, 


say asp<2 


II ( } bs) = O since (b3,8 <2)1. Theorem 3.5 now shows that ag = 0. 


a<Q asp<tl 
By Theorems 2.1 and 2.6, (c, a, , a < Q, bf, 8 < Q)L. Finally, from Theo- 
rems 2.7, 2.8, 3.3, 

a= > @a,O@c~ LD Obi’ ©@c <n = II bu. 


a<Q a< a<Q 
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Thusa « |] b,, as required. 


a<2 


THEeoreM 3.7. Ifb. « aandb,. S bsforalll Sa SB <Q, then ts ba & a. 


a<Q 
Proof. By Theorem 2.14 this is equivalent to the dual theorem to Theo- 
rem 3.6. 
THEOREM 3.8 (Continuity of perspectivity). Jf fim a, = aand lim b, = b 
and a, ~ ba for alla < Q, thena ~ b. a 
Proof. If 8B: , Be < 2, and 8 = max (8;, 82), then 


II ae S a3 ~ bs S D da. 


api a2>Be 
Hence, by Theorem 3.6, il Qa = II > db b. = b. Then, by Theorem 3.7, 
Be<Q a2Be 
= > J] a, «bd, th: tho « 6 Similarly, b < a. By Theorem 2.13 this 


B,<2 ads, 
implies a ~ b, and the theorem is proved. 


THEOREM 3.9 (Additivity of perspectivity). Jf, for any ordinal Q, (ag, 
a <Q)1 and (b.,a <2)1L anda, ~ bafor alla <Q, then p> ®@a.~ >, Ody. 


a<Q 
Proof. Let Ag = > a;,B.= } bsforalla SQ. Itisclear that A. S A; 
B<a B<a 
and B, S B, for alla S B S Qand that A, = lim A,, B, = lim B, for all 
ay a-y 
limit ordinals y S 2. Suppose now that we have already established A, ~ B, 
for alla < w for somew £2. Then if w is a limit ordinal, we have also, by 
Theorem 3.8, that A, ~ B,. If, however, w has a predecessor A, then A, ~ B,, 
a, ~ b, together imply, by Theorem 3.3, that A, © a ~ By @ by, that is, 
that A, ~ B,. Since A; = B, = 0, hence A; ~ B,, it follows by induction 


on w that Ag ~ Bg, that is, that pm @ da~ ba @®b.. This proves the theorem. 
a<Q a<Q 


4. Weakening of the axioms. Consider a system L satisfying Axioms I-V 
as given in $1, but with Axioms II, III restricted to sets of indices of cardinal 
power < &, for some fixed NS > N.. Call such a system an N-geometry. 
For any pair of paren u, v with uw S »v let L(u, v) denote the subsystem of L 
of all elements x withu S z Sv. Thenif L is an N-geometry, it is easily seen 
that L(u, v) is also an -goometey. We shall call a set of elements a,, a eI, 
independent over u, if for some v 2 u, (aa, ae I) 1 in L(y, v). 

If L is an N-geometry and we restrict ourselves to some fixed L(u, v), then all 
those parts of the preceding sections which involve sums and intersections of sets 
of elements of cardinal power < ® will hold exactly as before.’ And if I isa 
set of indices of cardinal power < &, then every set of elements a, , a € J, bg, 


1t The transitivity of perspectivity holds since % > ®,. See [1]. 
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«7, has a sum and an intersection, and hence is contained in an L(u,v). Hence 
we can deduce the following theorems: 


TueoreM 4.1. Let L be an N-geometry and let Q be any limit ordinal such that 
the set of ordinals a < Q has cardinal power < &. If lima. = aand lim b, = b, 


a—Q a—Q 


and if dg ~ ba for alla < Q, thena ~ b. 


TuHeoreM 4.2. Let L be an S-geometry and let I be any set of indices of cardinal 
power <S. If aa, ael, and bs, Bel, are independent over the same u and 


dg ~ ba for alla eI, then bi aq ~ > he 


ael ael 
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THE VARIATION OF THE SIGN OF V FOR AN ANALYTIC FUNCTION 
U+iv 
By M. S. RoBertTson 
1. Introduction. The following theorem is M. Cartwright’s [2]:' 


THEOREM A. Let 
f(2 = > a2” 


be regular and multivalent of order p for | z| < 1 and have q zeros within this circle. 
Then forr <1 


| f(re"*) | < A(p)ug(1 — r)™, 
where A(p) is a constant depending only upon p, and where 
Mg = max {1ao|,|@:|,|@2],---,| el}. 


Recently, with the help of the preceding theorem, M. Biernacki [1] established 
the inequality for the coefficients of f(z) given by the following theorem. 
THeoreM B. With the same hypotheses as in Theorem A, the coefficients of f(z) 
satisfy the inequality 
| dn | < A(p)pgn?™ (n > 0), 
where 


Mg = max {| |, | G2], --- , | Gq |}- 


The following new theorems established in this paper appear to give somewhat 
similar inequalities under a different hypothesis. The theorems to follow below 
overlap Theorems A and B in some cases, especially when f(z) is real on the 
real axis. 

Let fre’) = U(r, 0) + iV(r, 0). Letz = re’ traverse the circle | z| =r 
once, starting at any point z = re’’’ where V(r, %) #0. Ifr < 1, V(r, 6) isa 
continuous function of 6. As z makes a complete revolution around the circle 
from z and back again to z , V(r, @) has either (i) a constant sign or (ii) changes 
sign an even number of times, if we assume that the number of changes in sign 
is finite. We now state the theorems to be demonstrated in this paper. 


THEOREM 1. Let 
a 
f@ = Daz" =U+iV 
0 
Received December 15, 1938. 
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be regular for|z| <1. If there is an interval 0 < 1 — 6 < r < 1 such that V 


does not change sign more than 2p times on|z\| = r for any value r of the given 
interval, then 
(i) |an| < A(p)upn™ (n > 0, p 2 0), 
™ : an E | ak | 
; ais 
ti) lim sup || $222 (p+h!i—h!’ 
(ii) |f(re") | < A(p)up(l — 1)?" (r <0), 
(iv) ~ | \f(re) |d0 < A(p)u,(1 — r)*” (p21,r <1), 
0 


where A(p) is a constant depending only upon p, and where 
Hp = Max { |do|,|@i|,|@2|,--+,|@p|}. 


THEOREM 2. If, in addition to satisfying the hypothesis of Theorem 1, f(z) is 
real on the real axis, then 


(i) lan| < A(p)upn??™ (n > 0, p 2 1), 
(ii) | f(re) | < A(p)up(1 — r)~ (r<1,p2 1), 
(iii) = r | f(re"*) |d0 < A(p)u,(1 — vr)” (r<1,p2 1). 


The function 


2+ a” 


= hix/p 
(l—ejenr ST EE 


shows that in Theorem | the exponents of n and (1 — r) cannot be replaced by 
smaller ones. In the same way the function z’(1 — z) *” fulfills a similar purpose 
for Theorem 2. 

The author [3] previously established Theorems 1 and 2 in part only for the 
special case up: = 0, i.e., when 


% = a = @&=--- = ay. = V0. 


In this special case the method consisted in expressing f(z) in terms of a function 
F(z) regular and having a positive real part throughout the interior of the whole 
unit circle. A modification of this method of proof has been adopted in this 
paper to make the theorems completely general. It was found necessary to 
replace the regular function F(z) by another having a pole of order not exceeding 
p at the origin and which no longer has the property that its real part is positive 
throughout the interior of the whole circle. Instead, F(z) is the limit of func- 
tions with a pole at the origin, regular on | z | = 1 and with positive real part on 
\z2| = 1. This representation for f(z) is perhaps not as interesting in itself as 
in the previous special case. However, it is sufficient to give the desired conclu- 
sion of the theorems above, though the results are not as sharp, as far as the 
constants A (p), «» are concerned, as in the special case wu», = 0. 
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2. Functions with positive real part and a pole at the origin. Before proceed- 
ing to prove Theorem 1, we need first the following lemma. 


Lemma. Let 


(2.1) F(z) = > A," 


k=—p 


be regular for 0 <|z\ < 1 with a pole of order not exceeding p at the origin. If 
RF (re) > Ofor0 <1 —6 <r <1, then 


(2.2) | An + A_n| S 2MAo, 
where A_, denotes the complex conjugate of A_n . 
Proof. 
n l ™ id, —nié 
Aa? = — F(re’)e "” dé (r < 1), 
2m Jo 


~ af 1 Peg 
At * Gio i 7 (ppi0) p—nid 
7 an Jo F (re®)e—"® dé, 


A,r" + A,r” 


2r 
1 [ (mr(re) je" ao, 
T #0 


|Aar” + A_.r”| 


lA 


Qr 
Lf (ger (re)} do = 29%Ao, 
T #0 


forl —5 <r < 1 since RF (re) > 0 in this interval. Letr—1. Then (2.2) 
follows. 


Coro.tiary. If F(z) is regular for 0 <|z\| S 1, and if merely RF(e**) = 0 
for all real 6, then (2.2) holds again. 


3. Proof of Theorem 1. We may assume that p 2 1 in the proof of Theorem 
1. For if p = 0, V(r, ) is of constant sign within the unit circle. Then either 
if(z) or —if(z) has a non-negative real part for|z| < 1. For this case- the 
theorem is well known. 

It will be sufficient to prove Theorem 1 in the case where 


(3.1) g(z) = > Q,2" 

is regular within and on the circle | z | = 1, assuming that the imaginary part of 
g(e"’), or v(@), changes sign 2p times on the circle |z| = 1. If then the conclu- 
sion to Theorem 1 is true for g(z), we may let f(tz) = g(z), 1 — 6 < ¢ < 1, where 
f(z) satisfies the conditions of Theorem 1. Then on|z| = 1 the imaginary 


part of f(tz) does not change sign more than 2p times. On letting ¢ — 1, we 
shall have Theorem 1. 
Since v(@) changes sign 2p times on the unit circle and is continuous, v(@) = 0 





f 
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for at least 2p distinct values of 6. Call these values 0; (¢ = 1, 2, --- , 2p) 
defined so that 


0 Ss & < 2n, 056 < 02 < 03 < +--+ < Oey, 
asad 0 < 6; — 6; < 2x fori >j (i,j = 1,2, --- , 2p). 
Then with a proper choice of 4; , 
v(0) 2 0 for Bo1 < 0 < Oo (s = 1,2, ---, p), 
(83) v(0) SO for O% <0 < Ores) (s = 1,2,---, p), 


where Oep41 = 0, + 27. Let 


aS = 3 (O2p-s+2 ~ 6441) (s _ 2, 3, ver oP 
he = $ (Oep-s42 a 6.41), a= $(0; a 62) (s = 2, 4 eee , P), 
(3.4) > (Oop 242 = 6541), >= 3 (62 — 6) (s — 2, 3, wares ,P); 
$1 = ag — ie 
os = §m — o1 — 2 — - ++ — ben — 3 (Oop-si2 + 9041) (8 = 2,3, ---, p). 
Let 
h,(z) = 2(1 — 2zcosy + 2)", 
g(ze~***) 
H(z) = ‘ 
(3.5) 1@) h,,(—iz) 
Hj-1(ze~**’) ‘ 
F . => ? = - = 2, v0, ee 9 e 
1 ;(z) or p= (j 3 p) 


Each of the functions H; (j = 1, 2,3, --- , p)isregularfor0 <|z| <1. Either 
they each have a pole at the origin whose order does not exceed p, or they are 
regular at the origin. 

(3.6) SH,(e*) = 2v(6 — ¢;)- {sin @ — sin (44 — »)}. 

The second factor of this product is positive for 44 — », < 6 < 4” + » and it is 
negative for $4 + 1 < 6 < $4 — »,. On the other hand, the first factor 


v(6 — ¢;) is positive for }4 — », < 6 < 34 + » as well as for p — 1 other non- 
overlapping intervals wherein the second factor of the product is negative. 


Thus the imaginary part of H,(z) changes sign 2p — 2 times on|z| = 1. A 
similar argument shows that $H2(z) changes sign 2p — 4 times on|z| = 1. 
Finally, $H,(z) does not change sign at all on | z| = 1, and is indeed positive. 


We thus have 


g(z) = (—1)”*H,(—ize”). I hy,(ze*#) 


(3.7) 


2p 


i(—1)? *e* F(z)-2” [] (1 — ze), 


s=1 
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where 


op = 3 >. 8;, F(z) = —iH,(—ize~*), 
(3.8) M 
F(z) = D> Axe’. 


k=—p 


F(z) is regular for 0 <|z| S 1 and RF(e’) = 0. F(z) is either regular at 
z = Oor has a pole of order not exceeding p, depending upon whether some of the 
first coefficients a , a; , --- , a, of g(z) are different from zero or not. 
From (3.1), (3.7) and (3.8) we have, on letting 
2p 2p 


(3.9) I] (1 — ze) = Diez’, 


s=0 


the identity 


2p 20 C) 
(3.10) >> cz" >> Gmz™ = i(—1)” a deat ee 

s=0 m=0 s=0 
whence we have, on equating coefficients of z” from either side of (3.10), 
(3.11) An—p = i(—1)" e+ >> anen—x. 


k=0 


2p 
Since (1 + z)*” dominates Il (1 — ze ***), we have 


s=1 


| (2p)! 
lenl S Ge yin! (n = 0,1, 2, --+ , 2p). 


Using (3.12), we obtain from (3.11) 


(3.12) 


n | ' 
(3.13 |A,_,| S (2p)! | ax | ; 
oe rl $ QP)! 2 Gt op nln! 
Let up = max {|ao|,|a,|,|a2|, ---,|a@,|}. Then forn = 0, 1,2, ---,p 


we have 


— (2p)! 
? =0 (2p — s)!s! 


ee [ (2p)! (2p) ] 


=o (2p — 1)!s! (p!)? 


2p—1 (2p)! | 
. m [2 * 2p] 


| A_n | MK 


(3.14) 


lA 


From (2.2) we have for n > p 


(3.15) |Aa| $ 2|Ao| S np] 2 + ol. 
(p!)? 








he 
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whileforO <<nZp 


: 
(3.16) |Aa| $ 2|4o| + | A iu, | 2 . Go| 
(p!) 
From (3.7) it follows that g(z) is dominated by 
a ». a(Qpt+e-)!, < ” 
(l1—z)”-2, | A,» |2’ = . ‘. Avo. 
) 2 | p | > (2p — 1)!q! ~ 2 | p | 


Consequently, 
“ (2p +q-1)! 

< 

= ~ (2p — 1)!q! 
92 + @P)'). (2p +q-1)! 

3 2 2p 

ia p> (2p — 1)!q! 

< A(p)upn” (n > 0). 


| Qn | | An—p— | 


(3.17) 


IIA 


Again, from (3.13) and (3.15) we have for n > p 


' ‘ ‘ e | ax | 
3. A,| S 2| Ao| S 2(2p)! , 


and g(z) is dominated by 


l—z » ¥ on ait a | 
( ) >| er + qj [200 2 OF HIOL mM 


k=0 


whence 


An 7 | a | 
3.19 sup |—>| =2 
(3.19) lim sup n| = 2X ( (p+ k)!(p — k)! 


This completes the proof of the first two parts of Theorem 1. The third part 


follows immediately from the first part, since for r < 1 


[f(re®) | < DO lan |r” < A(p)up (1 + Lon” 
(3.20) n=0 n=l 


< A(p)u,(1 — =, 
To prove the fourth part of Theorem 1, we write (3.7) as 
(3.21) g(z) = 1( = 1)” "ce °F (z)¢"(z), 


where 


¢(z) = :Ia- “7, 


2g'(z) _ 1 > 0. + =) 
$(z) 2p 1 \1 — ze-*]’ 











518 M. S. ROBERTSON 


whence Rzo’(z)/¢(z) > 0 for|z|< 1. Thus ¢(z) is univalent and star-like for 
|z| <1. It follows forr < 1 that [4] 





1 f* eee? yc 1 ¢ dB 
(3.22) 2r Jo r = Ondo [1 — rel | 
< A(t =) oz0 


Also we have by (3.14), (3.15) and (3.16) 


max | 2’ F(z) | < = - 7 4 
(3.23) lor ee: 
s A(p)up(1 — r). 


Combining (3.22) and (3.23), we have for p 2 1,r < 1, 


= | | g(re™*) | do 


~f |2F(@) |-|% dé 


Qe | 10) |p 
(3.24) A(p)up 1 ore’) a9 


l—r 2x Jo r 
< A(p)u,(1 — 1)”, 


where A(p) is a constant depending only upon p. This completes the proof of 
Theorem 1. 


IIA 


4. Proof of Theorem 2. It will be sufficient to prove Theorem 2 for 


(4.1) g(z) = >. anz”, dy, real, 
0 


when g(z) is regular for | z| < 1, g(z) real on the real axis, and when the imagi- 
nary part of g(e"’) changes sign 2p times on the unit circle. Since the coefficients 
are real, we have (if not for g(z) then for —g(z)) 

4, = 0, On41 = 7, Bop—s+2 = 27 — 6, (s = 2, 3, eee, Pp), 
(4.2) 2p P 
or = 426 = te + Li be + 4.41 — 6.)] = 2p — Ie. 


From (3.7) we then have 


(4.3) giz) = =. price 
II G — 2z cos 0; + 2’) 
7=2 


Thus g(z) is dominated by the function 
(4.4) (1 — 24) *(1 — 2)? ”- DU | Aap le” 
s=0 


which on account of (3.14), (3.15) and (3.16) is in turn dominated by 


1+ ‘) = A(p) up 
( 


l—s 1 — z)?" 


(4.5) (i — 2) "(1 — 2)’ Aa ( 








for 
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Thus 
(4.6) |an| < A(p)upn”?™ (n > 0, a, real). 
- 16 A(p)up 

(4.7) \f(re") | < Tn 
To prove the third part of Theorem 2 let 
(4.8) ¥(2) = -— : — (p> 1), 

| 1 (1 — 22 cos »; + “| 

j=2 
2y’(z) 1 . if 1-2 } 
9 ) = ) 0 z| <1), 
(4.9) Ge) “= li = 22 cos »; + 2 - (el <1) 
_ 2°F(z) fy(e)\?* 
(4.10) g(z) = ~ "| : ‘ 
where ¥(z) is star-like for |z | <1. Hence forz = re’, p > 1, 
a A(puy 1 [| 1 | lve |r 
2r 0 |g(2) |d < l—r 2x 0 1] — 2? Z ¢ 

Pr ihe | L Lr dé ee iZ v(2) ao} 
(4.11) l—r \2r/o |1—2/? 2m Jo Zz 

< 4 ana | A )( A(p) ) 

l1—r\l-r (1 — rj‘? 
< A(p)u,(1 — vr)” (p> 1,r < 1). 


We shall show that this last inequality is also true when p = 1. Since a and a, 
are real, and a, ¥ 0 for p = 1, the imaginary part of [g(z) — g(0)]/a; is the same 
as for the function g(z)/a,; and is positive only when the imaginary part of z is 
positive. By a theorem on typically real functions [4] we then have 

r| ai | 


1 [ - a - 
(4.12) roe | |g(re") — g(0) | dé Ss —- 


whence (4.11) follows for p = 1. 
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CONVEXITY IN A LINEAR SPACE WITH AN INNER PRODUCT 
By Davin Moskovitz AND L. L. Dings 


1. Introduction. In Euclidean space, a point set Yt is said to be linearly 
connected (or convex) if, when x = (a, 22, ---,%n) and y = (y1, Y2, «++ , Yn) 
are any two points of the set, all points on the segment joining z and y also belong 
to the set. Analytically: 


(1) If x and y are in M, then kx + (1 — k)yisinMfor0 Sk J 1. 


A line (in two-dimensional Euclidean space) or a hyperplane (in n-dimensional 
Euclidean space) is said to be a supporting line or hyperplane of a set QM if it 
passes through at least one point of Mt and does not separate the points of M. 
A set 2 such that through each of its boundary points there can be passed a 
supporting hyperplane is said to be completely supported at its boundary points. 

In Euclidean space, the following theorems are well known.’ 

TuHeorem A. If a point set is linearly connected, it is completely supported at 
its boundary points. 

TueoreM B. If a point set is closed, possesses inner points, and is completely 
supported at its boundary points, then it is linearly connected.” 

The purpose of this paper is to examine how far these theorems persist in a 
more general space. Obviously their validity will depend on the space and the 
definitions assumed. 

We center our attention on a space © which is linear and in which an inner 
product is defined. That is, for any two elements z and y of our space there is 
assumed to exist a unique real-valued number denoted by ((z, y)) and called the 
inner product of xand y. This inner product is assumed to satisfy the following: 


((ex, y)) = e((z, y)) for every real number c¢, 
((x + y, 2)) = ((z, z)) + ((y, 2)), 

((z, y)) = ((y, z)), 

((x, z)) 2 0, 


IV 


with the equality holding if and only if x is the zero element of our space, and 


| (x, y)) | S (x, z))*-((y, y))’. 


Received January 18, 1939; presented to the American Mathematical Society, September 
6, 1938. 

! See, for example, American Mathematical Monthly, vol. 45(1938), p. 202. 

2 For a symmetrical theorem, but one which is weaker than Theorems A and B, we have 
the following: /f a set M is closed and possesses inner points, then linear connectedness of the 
sel M implies complete support al its boundary, and conversely. 
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If we denote the non-negative real number ((z, z))' by || z ||, the function 
|| 2 — y || of the two elements z, y has all the necessary properties for a distance 
function and serves as a metric for the space. In particular, the important 
triangle inequality 


lez+yll silzil+iiyil 


is satisfied. Convergence in terms of this metric is defined in the usual manner, 
and the space S thus metricized is assumed to be complete. 

We generalize the Euclidean hyperplane in what appears to be a natural 
manner; namely, if a is a point of our space such that || @ || + 0, the points z 
which satisfy an equation of the form 


((a, x)) = constant 
will be said to form a plane.’ In particular, the plane 


((a@, x)) ” ((@, X)), 
or what is the same thing 
((a, - Zo)) _ 0, 


will be called a plane through the point 2 . 
The plane 


r(x) = ((a,z)) —c = 0 


determines two closed half-spaces in S, one consisting of those points z for which 
x(x) = 0, the other of those for which r(x) S$ 0. The corresponding open half- 
spaces are represented if the equality signs be omitted. 

Any set 92 serves to classify all planes of the space © into three categories 
with associated names as follows. Relative to the set Mt, a plane 7 is called 
(1) a bounding plane if all points of M lie in one of the open half-spaces deter- 
mined by x, (2) a separating plane if points of MM lie in each of the open half- 
spaces, or (3) a supporting plane if all points of M lie in one of the closed half- 
spaces and at least one point of J lies on zx. 

Let M denote a set of points in the space S. An element of S which is not a 
point of I will be called an exterior point of M; a point of M which is the limit of 
exterior points will be called a boundary point of I; and a point of M which is 
not a boundary point will be called an inner point of M. 

A set I2 will be said to be linearly connected if condition (1) is satisfied. 

Assuming these definitions, we shall prove that Theorem B generalizes com- 
pletely (Theorem 6) but Theorem A does not. An example is given in §4 which 
exhibits a linearly connected set which has boundary points through which no 
supporting plane exists.“ 

* For the sake of brevity, we will use the term plane instead of hyperplane. 

‘In a paper published in Annali di Matematica, vol. 10(1932), entitled Sugli spazi 


lineari e loro lineare varieta, Ascoli shows, with a different definition of a plane from the 
one we use, that in a separable space a linearly connected set which possesses inner points 
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Since Theorem A does not generalize, it becomes desirable to characterize 
those boundary points of a linearly connected set through which supporting 
planes exist. We call such boundary points normal points, and show that they 
are projections (to be defined below) of exterior points of the set M. If yo is an 
exterior point of the set M, we will call the distance from yo to M the greatest 
lower bound of the aggregate of values of || yo — x || as x varies over M. If the 
distance from yo to PM is attained at a point 2% of M, we will call xo a projection of 
yo on J. We shall show (Theorem 11) that a linearly connected set M is 
supported at those boundary points and only those which are the projections of 
exterior points of 92; and (Theorem 14) in case the set I is closed as well as 
linearly connected, then the set of such normal points of J? is everywhere dense 
on the boundary. 

In §2, we establish some useful geometric properties of the space S which are 
needed in the later developments. We close our work (Theorems 15, 16, and 
corollaries) by showing that, with certain additional assumptions on our space, 
Theorem A generalizes completely. 


2. Some geometric properties of the space ©. 

THEOREM 1. The projection of an exterior point yo upon the line of points 
(2) t= Xo oo t(2, = Xo) 
is unique and is obtained when the parameter t has the value 


— ((Yo — 20, t1 — %)) 
\| t1 — 2oll? 


(3) t 


Proof. From the definition of || yo — x || it follows that 
yo — x\\' =|! yo — xol!) + || 1 — ao||” — 2((yo — 20, r1 — %)), 
and from elementary considerations it follows that this quadratic in ¢ has its 
minimum value when ¢ has the value (3). 


THEOREM 2. The projection of the exterior point yo upon the plane 
(4) a(x) = ((yo — 2,2 — %)) = 0 


is the point x». 


Proof. If x; is any point of the plane distinct from x, the projection of 
yo upon the line (2) is given by the parameter value (3). Since 2; satisfies (4), 
this parameter value reduces to zero, and the projection is 2 . 


(a convex body) is completely supported at its boundary. However, in this paper we do 
not assume our space to be separable and we do not restrict our set I? to be a convex body. 

Since the present paper was submitted for publication, we have been able to prove 
without the assumption of separability that a linearly connected and closed set which 
contains inner points is completely supported at the boundary. 
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THEOREM 3. The projection of a point yo on a plane 
r(x) = ((a,2))-b=0,  jla|| = 1, 
is unique and is given by 
Lo = Yo + ha, where tt = b — ((a, yo)). 


Proof. It is easily verified that 2 is on the plane x. Suppose z; is any other 
point on 7; then 


Il yo — ai ||’ = || yo — to + 2 — 11 || 
= || yo — 20 ||? + || zo — 21 ||? — 2to((a, 2 — 21)) 


112 r 1)2 
ll yo — to|f + |\la— ail, 


the last equality following from the fact that both z) and x, areon 7. Hence 
ll yo — a1 || > || yo — ol, 


and 2 is the projection. 

The following corollary is easily established. We give it without proof, since 
it is a special case of a more general theorem (Theorem 9) which is established 
later. 


Corotuary. If 2% is the projection on a plane m of the point yo, it is also the 
projection on x of any point on the line xoyo . 


THEeorEeM 4. Through a given point x there is one and only one plane on which 
% is the projection of a given point yo. Its equation may be written 


w(x) = ((yo — 2,2 — %)) = 0. 


Proof. That 2 is the projection of yo on this plane was shown in Theorem 2. 
To prove the uniqueness of this plane, let us suppose that 


n(x) = ((a, x = Xo)) ” 0, || @ | _ l, 


is any plane through 2 on which 2 is the projection of yo. Then by Theorem 3 
it follows that 


Lo = Yo + ha, 
where & is a well-defined real number. Hence 
- m. (yo -_ Zo) 
7 ty 
and 


x'(z) = — ira); 


that is, the planes x’(z) = 0 and x(x) = 0 are identical. 








524 DAVID MOSKOVITZ AND L. L. DINES 


Derinition. The set of points x for which || x — 2 || S r will be called a 
sphere of radius r about the point z. The set of points for which the equality 
holds will be the boundary points of the sphere. 

It is easily verified that the sphere is a linearly connected set. 

The following lemmas are needed in the proofs of Theorem 5 and subsequent 


theorems. 
Lemma 1. If 2 is an inner point of a set M, there is a sphere about x» in which 
every point belongs to M. 


Proof. Suppose this were not true; then corresponding to any preassigned 
\ 


monotone decreasing sequence of positive numbers {r,} there would be a 
sequence of exterior points {z,} such that 

\| zo — 2a|| < Ta. 
The greatest lower bound of the sequence {r,} cannot be zero for if it were, 1% 
would be a limit point of {z,}, in which case x» would be a boundary point of M. 
If the greatest lower bound of the sequence {r,} is the positive number r, then 
in a sphere of radius less than r about 2 there are no exterior points of J. 


Lemma 2. A supporting plane of a set It cannot contain an inner point of M. 
Proof. Let 
a(x) = ((a, z)) —-b = 0, \|a || = 1, 
be a supporting plane of I. Let 2» be an inner point of M which, if possible, is 
on the plane z, and let r be the radius of the spherical neighborhood about » 
each point of which belongs to Jt. Then the points 
= %+ra and x= %— Ta 

arein Jt. But 
((a, 2 + ra)) — b 


II 
= 


a(x) 
and 

m(te) = ((a, % — ra)) —b = —F. 
Consequently, x; and x2 are separated by the plane 7, and the hypothesis that « 


is a supporting plane of M2 is contradicted. The contradiction proves that % 
cannot be on a supporting plane. 


TueoreM 5. Through each boundary point of a sphere there passes one and only 
one supporting plane. It is that plane on which the boundary point is the projection 
of the center of the sphere. 


Proof. If the center of the sphere is yo and its radius r, then the boundary 
point 2» may be expressed in the form 


(5) o= Yot rh, || wo || = 1. 
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That the plane 
w(x) = ((yo — %0, 2 — %)) = O 


is actually a supporting plane may be verified as follows. Substitution for 2 
of its value from (5) in r(x) gives 


(6) a(x) = ((—ruw,2 — yo — rw)) =r — A, 
where 

A = r((ue, x — Yo)). 
Since 


|A| Srl woll-\|z — yl] =r|iz— yl, 


it follows from (6) and the definition of the sphere that, for all points z of the 
sphere, we have x(x) 2 0. 

The uniqueness of the supporting plane follows immediately from Theorem 4 
in view of the fact that x» must be the projection of the center yo. If some point 
a ¥ 2 were the projection of yo , its distance from yo would be less than r, and 
so it would be an inner point of the sphere. But, by Lemma 2, an inner point 
of a set cannot be on a supporting plane of that set. 


3. Generalization of Theorem B. 


THEOREM 6. Jf I is closed, possesses inner points, and is completely supported 
at its boundary points, then it is linearly connected. 


Proof. Suppose 9 is not linearly connected; then there is a segment joining 
two points x, and zz of I on which there lies a point y exterior to Mt. We 
choose some interior point 23 of I and denote by v the (certainly existing) bound- 
ary point on the segment joining x; to y. From their descriptions, y and v may 
be expressed in the forms 


y = kx + (1 — k)are, v = k’'y + (1 — k’)as 


where 0 < k < land 0 < k’ < 1, and from this it follows that v may be ex- 
pressed in the form 


(7) V = AX, + AeXe + A323, 
where 
(8) a, > 0, a, > 0, as > 0. 


Now, by hypothesis, there exists through the boundary point v a supporting 
plane x(x) = 0, the definition thereof necessitating that r(v) = 0, while no two 
of the real constants 2(2x,), +(22), (xs) differ in sign. But by (7) 


m(v) = ayr(x1) + Ger(X2) + asr(2s), 
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or 
to 


and so, in view of (8), 
(2X1) = 1(X2) = a (23) = (), 


But x3 is an inner point of I, and the condition x(x3;) = 0 implies that x3 is ona 
supporting plane, and this was shown in Lemma 2 to be impossible. 
Therefore our original supposition was incorrect and YJ is linearly connected. 
For a stronger theorem than that given in the preceding, we have the following. 
THEOREM 7. If Mis closed, possesses inner points, and is supported at a set of 
points which ts everywhere dense on the boundary, then Wt is linearly connected. 


Proof. Let x; and x2 be two points of 2 on whose segment, if possible, there 
. . . / . . . 
is an exterior point y. Let 23 be an inner point of Yv; then there exists a bound- 
P P . i , 
ary point v’ on the segment joining x3 toy. Let 


D = || a3 — y|| and d = || v’ — y||. 
Then 
' D 
t3—y=-;(v’ — y), 
d 
and 


, D 
m=yt+— (v’ — y). 


. . , . . 
Let r be the radius of the neighborhood about 2; for which all points belong to NM. 
For any preassigned « > 0, there exists at least one normal boundary point v 
such that ||}v — v’ || < «. Choosee=rd/D. Let 
v=v' +c, 


where | A || = 1, ande < rd/D, be such a normal point. Let 
D 
s=yt--(v—y); 
d 


then z; is on the line joining y to v but not on the segment yo since D/d > 1. 
Also 
D ; D 


= \le-» l= are <r. 


D 


/ 
i3—~ 73 *= — 
d 
Hence z; is an inner point of I, and the conditions upon the points 2 , 22, 2, 
y, and v are precisely those upon the similarly named points in Theorem 6, 
and from these followed the contradiction. The same argument therefore proves 
that WY is indeed linearly connected. 


Ul 
(vu—v'); || xs — 2s || 


4. On the generalization of Theorem A. 


THeoreM 8. If the exterior point y has a projection on the linearly connected 
set IN, this projection is unique. 
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Proof. Suppose x; and z2 were distinct projections of y on Jt. Then the 


point 
z= 3(%1 + 22) 


would be distinct from x; and z2 but would belong to M, and we easily obtain the 
impossible conclusion 


(9) iv —2i <Us=— ai. 
For 

lly—all=||ly-z2et2-a|/ 
(10) | 112 12 

= ily —-2]| + ||z — a || + 2P, 
where 
P = ((y — 2,2 — %)) 

(11) = ((}ly — a] + 4ly — 2], ly — a] — 4ly — 22])) 


Hily — al? — \ly — zl] = 0. 


From (10) and (11) follows the contradiction (9), and the contradiction proves 


the theorem. 


DeFINITION. If 2, is an exterior point of M and 2 is its projection, the set of 
points z. of the form 


(12) Le = X + c(X — I) (c > 0) 
will be called the projector of x; on M. 


THEOREM 9. If, on a linearly connected set It, xo is the projection of an exterior 
point x, , it is likewise the projection of all points on the projector of x, . 


Proof. Let z., given by (12), be a point on the projector of 2. 
We first prove the theorem for the case 0 < ¢ < 1, by showing that the 
existence of a point y of I for which 


(13) | te — y ll < |] te — 2o]| 

implies the contradiction 

(14) lai — y || < lla: — oll. 
By a sequence of simple reductions we have 


lla —yll S lla — xe ll + lle — yl 








< || a1 — Ze || + || Ze — roll, by (13) 
= || a1 — Xo — c(t — Io) || + || e(t1 — Xo) ||, by (12) 


= ||t1 — xl, 


and the contradiction (14) follows. 
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In case c > 1, from the definition of z. , we have 
a = % +c (rt. — 2) O0<c'< 1), 


If the projection of z. on It were y, then from the result of the first case the 
projection of z; on J would also be y. But the projection of z; on M is x, 
and since this projection is unique (Theorem 8), we must have y = 2» ; hence hee 
projection of z. on M is 2% . 


THEOREM 10. If xo is the projection of an exterior point yo upon the linearly 
connected set IN, then 


(x) = ((Yo — %, 2 — %)) = 0 
is the equation of a supporting plane through 2 . 


Proof. It will be sufficient to show that x(x) < 0 for all points zx of M. To 
this end, we suppose that there is a point 2, of M for which x(x) > 0 and obtain 
the contradictory result that 2 is not the projection of yo on M. 

Since M is linearly connected, points of the line 


(15) zr=%mt+ t(x, —- Xo) 


corresponding to parameter values 0 < ¢ S 1 belong to M. It is easily verified 
that yo cannot belong to the line (15) without contradicting one of the conditions 
x(yo) > 0, yo is exterior to M, or || yo — Zo || < || yo — 2: ||. Hence no point of 
the projector 


(16) Y = Xo + (Yo — X) (e > 0) 


with the exception of the projection 2» itself lies on (15). We may therefore use 
formula (3) to find the projection of any point y of (16) on the line (15). The 
result is 
Dy = % + €A(x, — %), 

where 
= ((yo — Zo, 1 — Zo) - (2x1) 

|| %1 — Zo ||? [| 21 — 2o||? 
Since x(z,) is by assumption positive, ¢€ can be chosen positive and so small that 
p, belongs to the segment of (15) in IN; we so choose «. Now the distance 


7 — Zo || is greater than the distance ||y — p,||. For 
Y— % = (y — Py) + (py — 2), 
iv7T=- to ||’ =|ly - Py || + || py — to ||’, 


because, as may be easily verified, 
((y — Py, Py — Xo)) = e [A ((yo -~ s, & — o)) — A’ \| ay ss Xo ||") = 0. 


Thus, zo is not the projection of y on Jt. By Theorem 9, 2 is therefore not the 
projection of yo , and the desired contradiction is reached. 
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THEOREM 11. A linearly connected set It has supporting planes at those, and 
only those, of its boundary points which are the projections of exterior points. 


Proof. If xo is the projection of an exterior point on QM, the existence of a 
supporting plane through 2 follows from Theorem 10. 

Suppose, conversely, that x is a boundary point through which passes a 
supporting plane 


(x) = ((a,  — %)) = 0, || a || # 0; 
and suppose for definiteness that 
x(x) £0 
for all points xz of M. 
If we define yo by the relation 
Yo = % + a, 


then 
llyo — 2 ||? = ||am —2+a|l? = || a — 2\|+ || a ||? — Q(z) 


which, for all z belonging to MM, is obviously non-negative and attains its mini- 
mum value || « ||’ when and only when z = 2%. Therefore 2 is the projection of 
yoon M. 

We now state and outline the proof of a theorem which will be useful in 
further developments, and which has perhaps some inherent interest. Some 
tedious algebraic details are omitted in the proof, but they are similar to others 
which have appeared earlier in the paper. 


THEOREM 12. Jf is a linearly connected set of points belonging to the ‘spherical 
shell” defined by 


rs|jxr-—al|srt+h, 
then the distance between any two points of N does not exceed 2(2rh + h’)!. 


Proof. If 2, and 22 are two points of N, the line segment between x; and 2; 
certainly does not pierce the inner sphere. However, the entire line, of which 
this segment is a part, may or may not pierce this sphere. It is accordingly 
desirable to divide the proof into two cases depending upon whether the condition 


(17) lla, + k(ze — 21) —all|2r 


is satisfied for all real values of k, or only for some including certainly 0 S k s 1. 
Case Il. When (17) holds for all real k. 
We denote by w the projection of the center a upon the line 2,2; (Theorem 1), 
and by u the intersection of the projector aw with the boundary of the sphere 
z—a\| Sr. By Theorem 5 there exists through this boundary point u a 
supporting plane of the sphere, and its equation may be written 


a(x) = ((u—- a,x — u)) = 0. 
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Simple considerations then verify that 


m(%i) 20 and -x(2z2) 2 0. 


These results will be used after a consideration of Case IT. 

Case II. When (17) does not hold for all real k. 

In this case there will be a parameter value k = k, for which (17) will not 
hold and a corresponding point 


y=ut ky (x2 “ 21) 


of the line 2,272 which lies in the inner sphere. The parameter value k; will be 
less than zero or greater than one according to the relative rdles played by 2; and 
te. Wewill assume that k,; < 0, and this means (geometrically speaking) that 
2, is closer than zz to the inner sphere. 

We denote by wu the intersection of the line az; with the boundary of the inner 
sphere |! — a|| S r, and write the equation of the supporting plane of the 
inner sphere through u in the form 


a(x) = ((u— a,x — u)) = 0. 


Again, simple considerations, depending upon the positions of a, y, z;, and 2 rela- 
tive to the supporting plane and upon the assumed sign of k, , enable us to show 
that 


m(zi) 20 and r(x) > 0. 


In either Case I or Case II we find an upper bound for the distance || z2 — 2 | 
as follows. Since 


2 


az — al! = |\a;, — ull? + || u — all? + (2, — u, u — a)) (¢ = 1, 2), 


we have, by transposition and substitution, 


< (r +h)’ — 9° = Qrh + Ph’. 
Therefore we have, since 
lt — || S ||a2— ull + ila — ull, 
the desired conclusion 
\| z2 — 21 || S 2(2rh + hr)! 


TueoreM 13. If the linearly connected set IQ is closed, every exterior point of M 
has a unique projection on WM. 


Proof. It is only necessary to show that every exterior point has a projection 
on ¥; it will then follow from Theorem 8 that this projection is unique. 

Let y be an exterior point of I2 whose distance from IM is equal to d; it is 
necessary to show the existence of a point x in I for which ||z — y|| = 4 
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Select a strictly monotone decreasing sequence of positive numbers {e,} with 
limit zero. Corresponding to each e, there exists an z, in J? such that 


ds||t—yllSdtan. 


Corresponding to any preassigned positive number e, there exists an index m 
such that 


2(2dem + é)! ¢¢ 


and such that the points z, for n 2 m all belong to the linearly connected 

aggregate common to Jt and the sphere || z — y || S d + em, which aggregate 

must in fact lie wholly within the spherical shell d S || z — y || Sd+ em. 
Hence, by Theorem 12, we have 


| tn — Insp || S 2Qden +)’ <e (n= m;p = 1, 2,3, ---). 


The sequence {z,} therefore converges; and because of the completeness of the 
space S and the closure of the set I, there exists an element x of J such that 
lim ||2 —2z,|| =0. Now 


ano 


z—y = (rt — zn) + (2a — y), 


and 
[\]2— tall — lla -—ylll Silz—-—yll Sllz—20|| + lla. — 9 |]. 
Since 
lim || z. —y|| =d and lim || z — z, || = 0, 
n—>00 n—-e0 


it follows that 
lz —yl||=d. 


In order to show that the linearly connected set 9 is completely supported at 
its boundary, it would be necessary to show that each boundary point of J? is the 
projection of some exterior point. That this is not true even for linearly con- 
nected and closed sets can be illustrated by the following 

Example. Let S be the space of functions whose squares are Lebesgue 
integrable on the interval0 St <1. Let 


((a, y)) [ ax(t)y(t) dt. 


Let M@ be the set for which | z(t) | < ¢. It is easily verified that M is linearly 
connected and closed. Also x(t) = 0 is a boundary point of WM, for it may be 
approached by the sequence of exterior points {y,(t)}, where 

y(t) = n° forrO Sts on", 
and 
forn' <t 31. 


II 
— 


yn(t) 
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But the set J has no supporting plane through the boundary point zo(¢). For 
suppose that there were such a supporting plane given by 


e(2) = [ * a(t)x(t) dt = 0. 


Since || a || # 0, there is a set of points E of positive measure on the interval 
0 < t S 1 for which a(t) = c > O or else a(t) S —c < 0, where c is some fixed 


number. For definiteness assume the former case true. Consider the points 
z(t) and 2x2(t) belonging to Mt defined as follows: 

a(t) =t onE and 2,(t) = 0 on C(E), 

a(t) = —t onE and z(t) = 0 on C(E), 
where C(£) denotes the complement of £ on the intervalO S$ ¢ S$ 1. Then 
m(x,;) > O and (zz) < 0. Hence x(x) = 0 separates points of M2. Therefore, 
there is no supporting plane through zo(t). 

Though a linearly connected and closed set Pt is not necessarily completely 
supported at its boundary, we can obtain a result on the distribution of the 
boundary points at which the set is supported. This result is given in the 
following theorem. 


TueoreM 14. The set of points at which a linearly connected and closed set M 
is supported is everywhere dense on the boundary of M. 


Proof. It is necessary to show that if 2» is any boundary point of PM, then 
corresponding to any preassigned « > 0, there exists a normal point x such 
that || 2; — to || <«. Let y; be an exterior point of M2 such that || yi — 2 || < «, 
and let the projection of y, on I be x; (Theorem 13); then 2, is a normal point 
(Theorem 11). The plane x(x) = 0, where 

mx) = (yi — 1,2 — N)), 


is a supporting plane of I through 2,. Since x(y:) > 0, we must have 
x(x) = 0. Now 


¥i-— m= (Yi — zi) — (to — 2%), 

and 
Ilys — ao |? = |] yx — aa |? — 2x(ao) + || zo — 24 ||’. 

Hence, 
\| zo — ai ||? = Ilys — zoll? — lly — 21 ||? + 2m(ao), 
from which it follows that 

\| tw — a ||’ < linn — aol’ < 5 
consequently, 


\| to — 2 || “€ é& 
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THEOREM 15. Let 2 be any boundary point of a linearly connected and closed 
set IM, and let {y,} be a sequence of exterior points having xo as their limit point. 
Let x, be the projection of y, on IM, and let 


Yn — In 


(18 a= ‘ 
~“tn-al 


7 “25 
Then if” from the sequence {a,} a subsequence |an,} can be selected so that for some 
element a in S we have 


(19) lim ((an,, 2)) = ((a, z)) for every x in S 
and 
(20) | a || #0, 


then 2 is a normal point of M. 


Proof. The planes 
m(t) = ((an,, 2 — Ln,)) = 0 


are supporting planes of 2 (Theorem 10). Let y be any point of I; then 


(21) m(y) S 0 for each k. 

We may write 

(22) wey) = (Cn » Y — Zo)) + me(20). 
Since 


lim ((an,,y¥ — %0)) = ((a, y — 20)) 


k-e00 
by (19) and 


lim az(xo) = 0, 


k-+00 
we have from (22) 


lim my) = ((a, y — 2Xo)). 


But because of (21), it follows that 
((a, y — %)) $0. 
Consequently, 
mo(xz) = ((a,  — %)) = O 


is a supporting plane of YP? through 2 , since mo(y) S 0 for every yin Mt. There- 
fore, zo is a normal point of M. 


* In particular this condition will be satisfied if the aggregate of boundary points of the 
unit sphere in S is weakly compact (Banach, Théorie des Opérations Linéaires, p. 239). 
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THEOREM 16. Let x be any boundary point of a linearly connected and closed 
set M, and let {y,} be a sequence of exterior points having xo as their limit point. 
Let x, be the projection of y, on M and let a, be given by (18). Then, if from the 
sequence {a,} a subsequence {a,,} can be selected so that for some a in S we have 


(23) lim || an, — a || = 0, 


k-—+2 
Zy ts a normal point of M. 
Proof. The condition (23) implies (19) and (20). Hence the hypotheses of 
Theorem 15 are satisfied, and zo is a normal point of WP. 
From the preceding theorem, the following corollaries are immediately de- 


ducible. 


Coro.tiary 1. A linearly connected and closed set It in a compact space is 
completely supported at its boundary. 


Coro.uary 2. A linearly connected and closed set MQ in a finite-dimensioned 
space is completely supported at its boundary. 


CARNEGIE INSTITUTE OF TECHNOLOGY. 
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FIXED SETS UNDER HOMEOMORPHISMS 
By J. L. KeLiey 


As a generalization of the Scherrer Fixed-Point Theorem’ W. L. Ayres has 
shown’ that a homeomorphism 7’ which carries a locally connected continuum M 
into a subset of itself carries some cyclic element C of M into a subset of itself. 
This result yields a fixed point theorem in an acyclic space or under certain other 
conditions. In this paper the following extension of Ayres’ theorem to non- 
locally connected continua will be proved: 


TurorEM I. If T is a homeomorphism carrying a compact continuum M into a 
subset of itself, then there exists a subcontinuum H of M, with T(H) = H and such 
that H contains no cut points of itself. 


This will be deduced as a consequence of the following series of lemmas. 


Lemma I. If M is a compact continuum and T is a continuous transformation, 
oo 


T(M) CM, thenz = Il T'(M) is an invariant continuum under T, i.e., T(x) = x. 
i=] 

Proof. Since T(M) C M, T**\(M) C T"(M) and therefore x is a continuum. 
If p ex, 7 '(p) intersects every set T°(M) and hence also x. Therefore p ¢ 7'(x) 
and  C T(x). If pex, peT'(M) for all i > 0 and hence T(p) « T*"(M). 
Therefore 7'(p) ¢ x and T(x) Cr. 

Without further proof we may also state 


Lemma I’.* Jf T is a homeomorphism and T'(M) C M, other conditions being 


the same as in Lemma I, then x = II T ‘(M) is an invariant continuum under 7. 
i=1 
Lemma II. The property of being an invariant continuum under a continuous 
transformation T is inducible. 


Proof. Suppose x = II C; with C; a continuum, 7'(C,;) = C; and Cin CC; 
i=] 

foralli > 0. Then “pa point of x” implies “p, and hence 7'(p), in all C,’”’; thus 

T(p) ex. It follows that 7(7) C wr. Furthermore, the inverse of every point 


p em intersects every C; and hence intersects r. Therefore pe T(x) and x C T(r). 


Received January 20, 1939. 

' Uber Ungeschlossene stetige Kurven, Mathematische Zeitschrift, vol. 24(1925), pp. 
125-130. 

? Some generalizations of the Scherrer Fixed Point Theorem, Fundamenta Mathematicae, 
vol. 16(1930), pp. 332-336. See also K. Borsuk, Einige Sdtze iiber stetige Streckenbilder, 
Fundamenta Mathematicae, vol. 18(1932), pp. 198-213. 

* If 7 is not a homeomorphism, the process defined here leads, by a lemma similar to 
Lemma II, to a collection of invariant sets not necessarily connected. These will be 
subsets of (not necessarily irreducibly) invariant continua. 
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As a result of the previous lemmas we have, by the Brouwer Reduction 

Theorem, the following. 
4 ° ° m+ ° ° , 

Lemma III.” Jf M is a continuum and T is a continuous transformation T(M) 
Cc M, then there exists a continuum N C M invariant under T and irreducible with 
respect to the property of being an invariant continuum under T’. 

We remark that, from Lemma I, it is impossible that the continuum WN of 
Lemma III contain a proper subcontinuum C such that 7'(C) C C, and if 7 isa 
homeomorphism, it is also impossible that T(C) > C. 


Lemma IV. Jf T(M) C M is a homeomorphism, no irreducibly invariant 
subcontinuum of M can contain a cut point of itself. 


Proof. Suppose the contrary. Let N be an irreducibly invariant sub- 
continuum of M with N = A + B, and A and B continua, the product AB 
being a single point p. Suppose T(p)«A. For xe A, define ¢(x) = T(z) 
provided T(z) «A, and ¢(x) = pif T(x) eB. Then ¢ is a continuous trans- 

2 
formation and ¢(A) C A. Let r = II ¢@'(A). Then =z is an invariant con- 
i=l 
tinuum under ¢. Suppose first that > p. Then on 7, ¢ is identical with T 
and is fixed under 7’, and this is a contradiction. Suppose then that p ez; 
then also T(p) ex. T(x) is a continuum containing 7(p) « A, and since 7'(x) 
contains all points of « — p, it contains also p. Hence T(x) > x. But it is 
impossible that any subcontinuum of N be contained in its transform. We have 
then a contradiction. 

The theorem then follows without further proof. 

We now make some applications of our theorem. According to G. T. Why- 
burn’ if M is a continuum, a subcontinuum N of M will be said to be a 0-th order 
cyclic element of M, or simply an Ep set provided N is a maximal with respect to 
the property of being a subcontinuum without cut points. We may now state 


TueoreM II. Jf M is a continuum and T is a homeomorphism, T(M) C M, 
then there exists either a fixed point in M or else a set Ey < M such that T(Eo) C Ey. 


Proof. An irreducibly invariant subcontinuum H of M is evidently either a 
point or is contained’ in a set Ey. If H is non-degenerate, then, since the 
transform of the set 2» containing H is a set of the same type, it follows that 
T(Eo) C Ey. 

We shall say that a set H has the fixed point property for homeomorphisms 
provided every homeomorphism 7(H) C H leaves one point fixed. From the 
previous theorem, a homeomorphism on a continuum M, 7(M) C M, implies 
the existence of a set Ey such that T7(£)) C Ey. We may then state 


‘It is to be noted that Lemmas I, II and III are proved for any continuous trans- 
formation. 

‘See Concerning maximal sets, Bulletin of the American Mathematical Society, vol. 
40(1934), pp. 159-164; also Cyclic elements of higher order, American Journal of Mathe- 
matics, vol. 56(1934), pp. 133-146. 
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ml 6 * . ~ ° 

THEOREM III.” Jf every set Eo in a continuum M has the fixed point property 
for homeomorphisms, so also has M; i.e., this fixed point property is Ey extensible. 

Since, if M is locally connected, the sets Ep are simply the cyclic elements of 
M, we may deduce from the two previous theorems the following known theo- 
rems (see footnote 1). 


THEOREM IV. If a homeomorphism T carries a locally connected continuum M 
into a subset of itself, then there exists a cyclic element C of M such that T(C) CC. 


THEOREM V. The fixed point property for homeomorphisms is cyclically ex- 
tensible. 


UNIVERSITY OF VIRGINIA. 


6 This result appears to be related to that of O. H. Hamilton, who has shown that a 
hereditarily unicoherent and hereditarily decomposable continuum has the fixed point 
property for homeomorphisms (see Transactions of the American Mathematical Society, 
vol. 44(1938), pp. 18-24). However, Hamilton’s theorem neither includes nor is included 
in Theorem III, for one can construct a hereditarily unicoherent and hereditarily decom- 
posable continuum which is without cut points. Hamilton’s theorem indicates the fixed 
point property for homeomorphisms for this continuum, while Theorem III gives no 
information. On the other hand, Hamilton’s theorem applies necessarily to 1-dimen- 
sional continua, while here there is no such restriction. 
























THE EXTENSION OF LINEAR FUNCTIONALS 


By A. E. Taytor 


1. Introduction. If E is a Banach space, its conjugate space E*, the space 
of all linear functionals defined on £, is also a Banach space. If MQ is a linear 
manifold in FZ, and ¢ is a linear functional on I, regarded as a normed linear 
space in itself, a well known theorem’ asserts that it is possible to extend ¢ 
to all of EF without increasing its norm. The extension thus obtained will not, 
in general, be unique. One of the considerations of this paper is the establish- 
ment of criteria for the uniqueness of such extensions in all cases—i.e., for all 
subspaces Yt of the given space. A sufficient condition is found to be that the 
unit sphere in E* be strictly convex; this condition is also necessary in case E 
is reflexive (for details see §4). 

Another question which arises is that of whether a rule of extension may be 
established which will be linear. That is, is it possible to define a linear opera- 
tion A on Y* to E* such that the linear functional f = Ag will be an extension 
of ¢, for each ¢g in M*? This will imply, as we show in §3, that M* is iso- 
morphic’ with a linear subspace in E*. This question is for the most part 
distinct from the question of uniqueness of extension and is discussed in §3 with 
the aid of the notion of a projection of a space on a subspace. §5 is devoted toa 
few remarks on a situation in which the uniqueness of extension and the exist- 
ence of the operation A, described above, are bound together. Finally, in §6 
an example is given to show the irredundancy of a part of the hypothesis of 
Theorem 3. 


2. Notation. Throughout the paper the following conventions in notation 
will be observed. E denotes a Banach space, E* its conjugate space, and E** 
the space conjugate to Z*. QM denotes a closed linear manifold in E, M* the 
space conjugate to 2% considered as a space by itself. We use letters z, y, --- 
for elements of E; f, g, --- for elements of E*; F, G, --- for elements of E**. 
Elements of I2* are denoted by ¢, and elements of I** by ®. 

For our purposes it is frequently convenient to introduce the following nota- 
tion for the values of various linear functionals. We write 


f(z) = (2, f), 
g(x) = [z, ¢], 
F(f) = tf, F}, 
(y) = (y, (@, 
















Received January 26, 1939. 
18. Banach, Opérations Linéaires, p. 55, Théoréme 2. We shall refer to this as the 
Hahn-Banach theorem. 

? For the definition of this term see the beginning of §3. 
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where the letters have the significance explained above. It should be kept in 
mind that (z, f) is a bilinear functional of the arguments z, f, and that || f || 
is the smallest constant C such that | (2, f)| S$ C||2|| for alla2e#. Similar 
remarks apply to the other expressions. 

We denote by T the linear operation on E* to Y* defined by the condition 


(2.1) [x, Tf] = (a, f), reM, f ¢ E*. 

Evidently || 7f || = ||f ||. But, by the Hahn-Banach theorem (see footnote 1), 

if g « M*, there exists an f « E* such that Tf = ¢ and ||f || = ||¢||. Hence 
T || = 1 and the range 7(£*) of T is precisely M*. 


3. The relationship of Y?* and E*. We first recall the notions of isomorphism 
and equivalence for normed linear spaces. £, and E; are said to be tsomorphic 
if there exists an operation A which carries F; into all of FE, in a one-to-one 
manner, and if further A and its inverse A are linear operations.* The spaces 
are said to be equivalent if in addition || Az || = || x || for each z in £,. 

In discussing the relationship of )t* and E* we shall assume that WM is a 
closed linear manifold. There is no loss of generality in so doing, for if M 
were not closed, we could consider its closure Jt, and the conjugate space (M)* 
would be equivalent to M*, as is easily seen. 

Finally, we shall define the notion of a projection of a space EZ. By a pro- 
jection of E is meant a linear operation P on E to E with the property P* = P. 
It is easily seen that the range of P is characterized by the fact that Pr = zx 
for elements of this range. Hence the range is a closed linear manifold. If 
the range is I, we say that P projects E on M. The conjugate operation P* 
on E* to E*, defined by the condition (z, P*f) = (Pz, f), x « E, f « E* is a pro- 
jection of E*, as may be verified at once. Its range is the class of f such that 


(x, f) = (Pz, f) for each z in E. 


TuEeoreM |. If P is a projection of E on M, there exists a linear operation A 
on IN* to E* with the following properties: 

(1) the range of A is the range P*(E*) of P*, and A sets up an isomorphism 
between IN* and P*(E*); 

(2) [z, g] = (2, Ag) if r eM, go e M*; 

(3) P* = AT; || A || = || P* ll = Il Pil. 

Proof. Define A by the condition (x, Ag) = [Pz, ¢], which condition is 
possible since Px ¢ Mt. Clearly A is linear, with || A || S || P ||. Since Pr = x 
in M, property (2) is evident. AT’ is an operation on E* to E* defined by 
(x, ATS) = [Px, Tf] = (Px, f) = (2, P*f). Therefore AT = P*. Since the 
range of 7' is exactly I*, we infer that the ranges of A and P* are identical. 
From (2) it follows that if Ag = 0, then g = 0. This means that A describes 
a one-to-one correspondence between Y* and P*(E*). Since both of these 





* By a linear operation we shall mean one which is distributive and continuous. 
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° ° ° ° 4 
sets are complete linear spaces, the inverse map is also continuous, and the 
mapping describes an isomorphism. Finally, from P* = AT we have || P* || s 
; an ae) 
|| Ajj. | 7, = ||A ||. Because of the relation’ || P* || = || P || we conclude 


the last part of (3). 


Corotiary. Jf || P || = 1, A sets up an equivalence between IN* and P*(E*), 

For then certainly | Ag || S ||¢||. But TAg = ¢ for all ¢ « M* because 
lz, TAg]) = (x, Ag) = [z, ¢] if reM. Therefore || ¢ || <= || Ag||, and so 
ie ii = || Ae || 


THEOREM 2. [f there exists a linear operation A on I0* to E* with the property 
(3.1) [z, ¢] = (x, Ag), reM, o e M*, 


the range A(M*) of A is a closed linear manifold in E*, and there exists a pro- 
jection Q of E* on A(M*). Furthermore, A sets up an isomorphism between IN* 
and the space A(IN*), and if A‘ denotes the inverse operation, then 


l 


(3.2) vo 


S ||Q|| = |All. 

Proof. We define Q = AT. Then Q is linear, and ||} Q|| S || A||. It is 
also immediate that the ranges of Q and A are identical, for the range of T' is 
exactly I2*. That Q is a projection follows from the fact that TAg = ¢ for all 
g € IN*; this is because [z, TAg] = (x, Ag) = [z, ¢] by the properties of A and T. 
Thus the common range of A and Q is a closed linear manifold in E*. Since 
TAg = ¢, we have |g) S || Ag||, and so Ag = 0 implies g = 0. As in the 
proof of the previous theorem this implies that A defines an isomorphism of Jt* 
and A(IN*). The relation A-'Q = T then leads to the inequality 1 = || T || 

A'|..|Q). This completes the proof. 


Corotiary. If A defines an equivalence of IN* and A(M*), || A || 
A™'|! = 1, and so ||Q|| = 1. 


lA 


Since a projection P of EF automatically induces a projection P* of E*, the 
question naturally arises as to whether the operation Q of Theorem 2, which 
is a projection of E*, may not induce a projection of EF on M. The natural 
way to investigate this matter is to observe that Q* is a projection of E**, and 
to proceed from this, using the well known fact that Z may be imbedded in E™. 
This method has been carried through in Theorem 3, below, with the aid of 
the additional assumption that 9 is reflexive. That this assumption is essen- 
tial in order to obtain the stated conclusions is shown by an example in §6. 

Derinition. A Banach space E£ is said to be reflexive if every element F 
of £** is representable in the form {f, F}| = (x, f) for some z in E and all fin E*. 


* Banach, op. cit., p. 41, Théoréme 5. 
* Banach, op. cit., p. 100, Théoréme 3. 
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When E is reflexive, the correspondence F <> x defines an equivalence’ of E 
and £**. It is further known that if £Z is reflexive, the same is true of all its 
closed linear subspaces.’ However, a non-reflexive space can have closed sub- 
spaces which are reflexive. For example, any finite dimensional subspace is 
reflexive. 

THEOREM 3. Ul/nder the assumptions of Theorem 2 and the further assumption 
that IM is reflexive there exists a projection P of E on M, with the properties: 

(1) P* = Q; 

(2) the operation A of Theorem 2 is given by (x, Ag) = (Px, ¢], and || A || = 
| P ll = |] Q |]. 

Proof. If we apply Theorem 1 to the space E* and the projection Q of this 
space, we infer, because of the fact that Q* is isomorphic’ with the range of Q, 
that P** is isomorphic with the range of the projection Q* of E**. Therefore, 
if we think of EF as being imbedded in E** and assume that 3 is reflexive, so 
that M and M?t** are equivalent, it is easy to see how Q* defines an operation 
on E to Mt. With this outline of the procedure in mind it is merely a matter 
of setting up the necessary notational machinery to carry out the details. 

First we define the imbedding of EF in E** by the operation V on E to E** 
given by 


\f, Vz} = (2,f), reE, f « E*- 


This defines a linear isometric (see footnote 6) (|| Vx || = || x ||) imbedding of £ 
in E**. V defines an equivalence of FE and E** if and only if £ is reflexive. 

Next, Q = AT, and so Q* = 7*A*, by the rules for forming conjugate opera- 
tions. Now A* is an operation on E** to M**, given by 


(vy, A*F) = [ Ag, F}. 


The range of A* is exactly IN**, for any functional & defined on Y* generates, 
by the operation A, a linear functional on the linear subset A(Q?*) in E*, and 
this latter functional has an extension to all of E*, by the Hahn-Banach theorem 
(see footnote 1). Thus A* plays the same réle with respect to E*, A*(M*) 


as 7’ plays with respect to ZF, M. The equation Q* = 7*A* now shows that 
the ranges of Q* and 7™* are identical. But more is true. 7™* sets up a one-to- 
one correspondence between IN** and the range of Q*, and therefore Y** and 
Q*(£**) are isomorphic. To see this we have only to prove that 7*& = 0 
implies @ = 0. Now 
(if, T*b} = (TY, ®), 
° For it follows from Banach, op. cit., p. 55, Théoréme 3, that sup | f(r) | = || 2 

(for all || f || = 1), so that || F || = || 2 ||. Even if E is not reflexive, this defines a 


linear imbedding of F in E**. 
™B. J. Pettis, Bulletin of the American Mathematical Society, vol. 44(1938), p. 423, 
Theorem 3. 


* The conjugate spaces of isomorphic spaces are isomorphic: see Banach, op. cit., p. 188. 
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and as f runs over E*, 7f runs over all of t*, and the above assertion is verified. 
Note that A*7*@ = @ for all in M**. For 


(y, A*T*&) = {Ay, T*®} = (TAg, 9), 


and we saw in the proof of Theorem 2 that 7’'Ag = ¢. 

Finally, if 2 is reflexive, there is a unique operation U on M** to M such 
that (yg, 6) = [U%, g]. We now define the operation P on E to M by the 
equation P = UA*V. It is evidently linear. It is a projection, for 


lf, VU®} = (U4, f) = [US, Tf] = (Tf, ®) = tf, T*}, 


that is, VU = 7*, and therefore P? = UA*T*A*V = UA*V = P, since A*T* 
is the identity transformation of J**. Now, if risin E, [Pz, gy] = [UA*Vz, ¢] = 
(yg, A*Vx) = {Ag, Vz} = (xz, Ag). But if z is in M, (xz, Ag) = [z, ¢], and 
therefore Px = x. The range of P is therefore exactly 2, and we conclude 
from Theorem 2 that || P || = || A ||. But also, 


(x, P*f) = (Px, f) = (Pz, Tf] = (x, ATf) = (z, Q) 


by the above reckoning. Thus P* = Q and the proof is complete. 

Remark. The assumptions of Theorem 2 are not fulfilled for all the closed 
linear manifolds in all Banach spaces, or even in all reflexive spaces. For it 
can happen that 92 is such that there is no projection of FE on M (see §6). 

As an instance of the situation set forth in Theorem 2 we note that if we 
regard E as being imbedded in E**, and E* as imbedded (in the same manner) 
in E*** then there is a projection, of unit norm, of E*** on the map of E* in E***. 
Looked at in another way, this amounts to saying that the identity operation of 
E* on itself admits a linear extension, as an operation on E*** to E*, without 
increase of norm. That the like statement is in general not true of E** and E 
is shown by the example in §6. There it is shown that if a projection of (m) 
on (¢9) exists, it is of norm 2 2. Since (m) is equivalent to (co)**, this furnishes 
us with a counter example in the present situation. 
































4. Unique extension of linear functionals. In this section the assumption 
that E is a complete space is unnecessary. 

Derinition. The space £ is said to have property A if, It being an arbitrary 
closed linear manifold in E, not E itself, and ¢ an arbitrary element of 2, 
there exists a unique element f of E* such that Tf = ¢, || f || = || ¢||, where 7 
is the operation defined in §2. (It suffices to consider M@ # E, for if M = EF 
the above requirement is automatically satisfied.) 

A closely related property of a space E is described in the following definition. 

Derinition. The space E is said to have property B if for each element 
zo © Oin E there is a unique element f of E* such that || f || = 1, f(vo) = || zo |}. 
The Hahn-Banach theorem (see footnote 1) guarantees the existence of at 
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least one f to meet the conditions laid down in the definitions. The uniqueness 
is the essential thing. 
It is clear that if E has property A, it also has property B. 


TureorEeM 4. If E has property B and if every closed linear manifold M, M ¥ E, 
in E is reflexive (as, for example, when E itself is reflexive), then E also has 
property A. 

Proof. First note that if Mt is reflexive and ¢ is in M*, then, for some 2 
in Mt, || zo || = 1 and g(a) = || 2o||. For there exists’ & « M**, || @|| = 1 
and @(y) = |\¢|)|. If M is reflexive, however, we can write ®(¢) in the form 
g(x) with x eM, || x|| = || e]||. Now suppose that fi, fee E*, with fi ¥ fe, 
Wfill = fell = lle|| and 7fi = Tf, = yg. Define gm = (1/|\ ¢ (Pf, ge = 
(1/\|¢ |)fe. Then || gi || = || ge|| = 1, and gi(to) = ge(ro) = 1 = || xo ||, 
but g:1 * gz, contrary to property B. 

Property B may be interpreted geometrically as follows: a set of elements S 
in E is called a hyperplane if and only if it consists of the totality of elements z 
satisfying an equation of the form f(z) = c. Two points not on a hyperplane S 
are said to be on the same side of it if the line segment joining them does not 
meet S. A hyperplane S is said to support the unit sphere || z || = 1 if and 
only if the distance from S to the unit sphere is zero, and all points for which 
|x || S 1, not on S, are on the same side of S. It may then be shown that 
a hyperplane supports the unit sphere if and only if it is given by f(x) = 1, 
where || f|| = 1. Through each point of the unit sphere there passes at least 
one supporting hyperplane. If there is at most one, we shall call it a tangent 
hyperplane. Property B is then the property that the unit sphere of EZ admits, 
at each point, a tangent hyperplane.” 

Another geometrical property of importance in the present connection is 
that of the strict convexity of the unit sphere. 

DeFINITION. The unit sphere in E is said to be strictly convex provided that 
z\|| = 1, ||y|| = 1,2 ¥ y, 0 < t < 1 imply that || & + (1 — dy || < 1. 

This means that the line segment joining two points on the surface of the 
sphere lies, with the exception of its end points, entirely inside the sphere. 
Every sphere in E will then be strictly convex also. 


THrorem 5. If E* has property B, the unit sphere in E is strictly convex. 
The proof depends on the following lemma. 


* See the reference in footnote 6. Instead of assuming that every WM is reflexive it suffices 
to assume that the unit sphere || z || = 1 in each subspace Mt is weakly compact. For if 
|an || = land ¢(z,) > |! ¢ || , we can then find zo e M such that ¢(z,) — ¢(xo) = || ¢ |! . 
It follows that || zo || 2 1; since a convex, closed set is weakly closed, we conclude |! x9 
=1. The proof is then as before. 

A general discussion of convex hypersurfaces and their supporting hyperplanes in 
normed linear spaces is given by 8. Mazur, Studia Mathematica, vol. 4(1933), pp. 70-84. 
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LemMMA. If the unit sphere in E is not strictly convex, there exists a line segment 
tx + (1 — dy, 0 St S 1, all of whose points lie on the surface of the sphere; i.¢., 

tx + (1 — éy || = 1. 

Proof. There exist distinct collinear points 2, 22, 23, with || 2; || = 1 
Suppose that z2 lies between 2, and z;. Unless all points of the segment 
(x, , 23) are of norm one (which is what has to be proved), there exist y between 
x; and ze and z between ze and 23, with either || y = 3. is < 1, o 

y|| <1, ||z|| $1. In either case z. = ty + (1 — t)z,0 < t <1, || ae/|| s 
t\l\y!! + (1 — 2) |\2!! < 1, a contradiction.” 

We now return to Theorem 5. If it is false, we may suppose that 2; , 22 and 
the line segment joining them lie on the surface of the unit sphere. That is, 

fx, + (1 — tre|| = 1,0 St 1. Then || tx, + (1 — é)xe|| 2 1 for all 
real ¢. We can choose fy ¢« E* so that || fo || = 1, folai) = 1 (@@ = 1, 2). For 
the condition that such an fy exist is that” 

ja+b| S || ax, + bre 


for all numbers a, b. If we let a + b = r, the above is trivially true 


when r = 0. If r # QO, ax; + baxe om ax, + (r — 4@)ze2 = 

r (a/r)a, + (1 — a/r)ze|| 2 |r| = | a+b), by the properties of x, 2. 
If now we define F; « E** by Fi(f) = f(zi), we have || F; || = 1 and Fy(fo) = 
1= fi. But PF; + Fy if 7, # ze, and so E* cannot have property B. 


THEeorEM 6. If the unit sphere in E* is strictly convex, E has property A. 
If E is reflexive, the converse is also true. 

Proof. Let M #¥ E, ¢ be given, and suppose that f, g « E*, || f ||) = || g\, = 

¢ ,7f=Tg=¢. Then T(tf + (1 — dg) = te + (1 — Ye = gand hence 

¢\/s Uu+1—tg)\\. Butif0O<t<landf ¥g,|\t{+ (U1 —bg|| <-¢, 
since the sphere of radius || ¢ || is also strictly convex. Hence we must con- 
clude f = g. 

This is the direct part of the theorem. If £ is reflexive, and if it has property 
A, then E** also has property A (since E and E£** are equivalent). #** there- 
fore has property B, and this implies, by Theorem 5, that the unit sphere in £* 
is strictly convex. 

Tueorem 7. If the unit sphere in E is strictly convex, and if E is reflexwe, 
E* has property A (which is the same here as property B, since E* is also reflexive). 

Proof. Because of reflexivity it follows that the unit sphere in #** is strictly 
convex. The conclusion then follows from Theorem 6. 

It is natural to ask the following questions. Does property A, or the striet 
convexity of the unit sphere in KE, imply the corresponding property for £” 
Are these properties dependent upon each other; that is, does either of them, 
possibly in conjunction with reflexivity, imply the other for the same space F? 







'! This proof of the lemma was suggested by the referee, as was also Theorem 7, below 





'? Banach, op. cit., p. 57, Théoréme 5. 
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The answers are in the negative, as may be shown by examples. To illustrate 
we consider a two-dimensional space of elements x = (x; , 2), where the num- 
bers 2; , 22 are the Cartesian coérdinates of the end point of the vector z. The 
conjugate space E* is also two-dimensional, and its elements may be represented 
on the same coérdinate system that is used for E. The “unit sphere” in each 
case is defined by a convex curve symmetric about the origin. By an examina- 
tion of the lines of support for the curve for FE one can determine, using the 
geometrical interpretation of property B, the nature of the curve for E*. In 
doing this it is found that a corner on the curve for E (i.e., a place where the 
supporting line is not unique) leads to a flat portion of the curve for E*. An 
example in which the curve for £ is strictly convex, but with corners, while the 
curve for E* is not strictly convex, and is without corners, is the following. We 
give the definitions of those parts of the curves in the first quadrant only. 


BE: (x1 + 4)° + (a2 + 3)° = §, 
% = 1, 0& am & 3. 
E*: O(xi + 23) — 2x2 — 4(x, + y-4=0(2 2% 8! 
= os A a re re ee i) (2 Te - <n <1 . 
“a = 1, 0 S 2x S }. 


Thus E is reflexive (because finite dimensional), its unit sphere is strictly con- 
vex, but it does not have property A; E* is reflexive, has property A, but its 
unit sphere is not strictly convex. Since Z** = E, this example shows that the 
answers to the above questions are in the negative. 

We shall conclude this section with a few remarks about the converse part 
of Theorem 6. In the first place, note that if # has property A, so do all of its 
linear subspaces. Now suppose that the unit sphere in E* is not strictly convex. 
Then evidently there is a two-dimensional linear subspace S of #* whose unit 
sphere is likewise not strictly convex. If S were the conjugate space of some 
linear subspace in FE, this subspace could not have property A, for S is reflexive. 
Therefore E itself could not have property A. It seems doubtful, however, 
that S need be the conjugate of any subspace of EF. 


5. Further questions. In this section we shall consider briefly the following 
question: Is there any connection between the property A for a space EF and the 
property that for every closed linear manifold Yin # the hypothesis of Theorem 2 
(or Theorem 2, corollary) is valid? Sinee, by Theorem 1, the hypothesis of 
Theorem 2 is valid if there exists a projection of FE on Yt, and since such a 


projection will always exist, for every W, if & is finite dimensional, we see that 
this does not imply anything about property A, which may or may not be 
present (see example in §4). The projection may even be of norm | and yet 
property A be missing. This happens, for example, if F is a two-dimensional 
space, with « = (x, %2), ||} 2 || = |a.| + | ae}, for here the unit sphere has 
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corners.” On the other hand, property A is enjoyed by the reflexive spaces 
L’, l’, (p > 1), and yet these contain closed linear manifolds on which there 
exists no projection." Thus by Theorem 3 the assumptions of Theorem 2 can- 
not hold for all closed linear manifolds in L’, 1’. 

In order to see more precisely what can be said let us, for a fixed I, consider 
the operation 7 defined in §2. Denote by § the set of elements f « E* for 
which || Tf || = ||f ||. § is not empty, and if M contains elements other than 
the zero element, so does §, by the Hahn-Banach theorem (see footnote 1). 
Moreover, 7(§) = M*, and if f « §, af « § also, for any constant a. Now if 
there is a unique way of extending a linear functional from J to EF without 
increasing its norm, the totality of elements of Z* generated by this extension 
process is seen to be exactly §. Consequently, if in addition to the uniqueness 
of extension we suppose that the operation A of Theorem 2 exists, with the 
property || Ag || = || ¢ ||, we conclude that A(M*) = §F, and hence that F isa 
closed linear manifold in E*. Conversely, if § is a closed linear manifold in E*, 
we have 

TuHeoreM 8. [If the set § is a closed linear manifold in E*, the following things 
are true: 

(1) T defines an equivalence of § and M*; 

(2) each linear functional ¢ on M has a unique extension of norm || ¢ ||; 

(3) there exists a projection, of norm 1, of E* on §. 


Proof. (3) is a consequence of (1), by Theorem 2, corollary. (2) is likewise 
a consequence of (1). (1) itself is immediate if we note that 7f = Tg implies 
0= || Tf — Tg}| = || TY —g) || = || f —g ||, whence f = g. Hence 7 maps 
on M* in a linear, one-to-one, isometric fashion. 

The hypothesis of Theorem 8 is satisfied, for arbitrary 2, if # is an n-di- 
mensional Euclidean space, a Hilbert space, or any space whose norm is derived 
from a positive definite bilinear functional. One might suspect that this is 
a characteristic property of such spaces. 


6. Examples. We have already quoted the fact (see footnote 14) that the 
reflexive spaces l’, L”, (p > 1) contain closed linear subspaces on which there 
exists no projection. Since the closed subspaces of a reflexive space are also 
reflexive, it follows, by Theorem 3, that a subspace of L” (or 1”) on which there 
exists no projection cannot satisfy the conditions imposed on the linear mani- 
fold M in Theorem 2. 

We next offer an example to show the need for the assumption, in Theorem 3, 


18 Any two-dimensional normed linear space always admits projections of norm 1 on an 
arbitrary one-dimensional subspace. See Bohnenblust, Annals of Mathematics, vol. 
39(1938), p. 302, Theorem 1. 

14 L,? and | (p > 1) have property B (see p. 79 of the reference to S. Mazur in footnote 10). 
Since they are reflexive they also have property A. For the assertion about linear mani- 
folds with no projections see F. J. Murray, Transactions of the American Mathematical 
Society, vol. 41(1937), p. 138. 
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that Mis a reflexive subspace. Our example, in which M satisfies all the condi- 
tions of the theorem except that of reflexivity, depends on the fact that for the 


operation A which we consider, || A || = 1, while any projection of E on M 
must necessarily have norm = 2. Theorem 3 asserts the existence of a pro- 
jection P of norm || P || = || A ||. It would be desirable to give an example 


in which there is no projection of Hon QM. It is an open question whether this is 
true or not in the example given. 


For It we choose the space (co) of sequences x = (2, 22, ---) converging to 
zero, With || x || = max|2z;|. A linear functional on (co) has the form 
ry 
0 
(6.1) ¢(x) _ Do ax, 
i=l 
where 


ell =D lal < @. 


Thus Q* is equivalent to the space (l) of absolutely convergent series. 

As E we take the space (m) of bounded sequences, with || z || = sup | 2; |. 
Then 2 is a closed subspace of EZ. The operation A of Theorem 2 obviously 
exists. We need only define Ag = f, where f(z) is also given by (6.1), for all 
te(m). Then|| Ag|| = ||¢|| and || A || = 1. Tosee that a projection of (m) 
on (co) (if such a projection exists) must be of norm not less than 2, it suffices to 
consider projections of (c) on (co), where (c), the space of convergent sequences, 
is a subspace of (m), but contains (co) as a subspace. 

Now a linear functional on (c) has the form 


eo 
v(x) = >» a;x; + 6 lim zy. 

i=l n—2 
Therefore a projection of P of (c) on (co) must be defined by a sequence of 
functionals Pr = {y,(x)} of the form 

Wi(x) = 2; + 5; lim 2, 
no 
where, since of necessity | ¥:(x) | — 0 for each z, we must have b; > —1. More- 
over, || ¥s|| = 1 +]; |.”° Now choose i so that || yi || = 1 + |b: | > 2 —- ¢; 
next choose 2, || z || = 1 so that | ¥:(x) | > || ¥i || — « > 2 — 2e. Thus|| Pr|| 
= max | ¥x(x) | > 2 — 2e, and || P || = 2, as we wished to prove. 
x 


UNIVERSITY OF CALIFORNIA AT Los ANGELES. 


16 Banach, op. cit., p. 66-67. 








CERTAIN CONGRUENCES INVOLVING THE BERNOULLI 
NUMBERS 


By H. 8S. VANDIVER 


° 1 ° ° ° 

In a previous paper I gave a general theorem concerning congruences in rings 

which yields congruences involving Bernoulli numbers, in particular, the relation 
known as Kummer’s Congruence: 


(1) h"(h?-* — 1)’ = 0 (mod p’) 


forn — 1 = j,n # 0 (mod p — 1), where p is an odd prime, the left member is 
expanded in full by the binomial theorem and then 6,/t is substituted for h’. 
The b’s are defined by the recursion formula 


(6+ 1)" = bd, (n > 1), 


where we expand the left member by the binomial theorem and substitute 5; 
for b*. The b’s give the Bernoulli numbers. 

A number of proofs of (1) have been given, all, as far as I know, including the 
restriction n # 0 (mod p — 1); in fact, a simple inspection shows that the result 
does not hold when n = 0. The question naturally arises whether there is some 
complementary theorem which provides for the case n = 0 (mod p — 1). 
Nielsen® investigated this problem and found the relation 


1 a s=r ili r 
l- _* > (—1) (") B,s (mod p), 


B, = (—1)""ban; wp = 3(p — 1). 
To obtain this he sets 
S,(m) = 1" + 2" + .-- + (m — 1)" 


and employs the following relation (proved by him previously), 


p—1 


k=r 
> (— 1)' (;) Sons2ku(p) = ym s**(1 —_ S*)’, 


and 
Som(p) = (—1)""Bnp (mod p’) (p > 3, m > 1). 


The result follows. Now it is not clear how this argument may be extended to 
the examination of (2), modulo p*, in lieu of modulo p. But the method of my 


Received January 30, 1939. 

1 Bull. Amer. Math. Soc., vol. 43(1937), pp. 417-423. 

2 Journal fiir die Mathematik, vol. 41(1851), pp. 368-372. 
3 Traité des Nombres de Bernoulli, pp. 277-278. 
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paper referred to in footnote 1 for the derivative of the relation given at the top 
of page 422 of that paper yields an extension, after certain devices are intro- 
duced. I shall confine myself to the proof of the 


THEOREM. If pis an odd prime, then 
(2) BrP? (bP * — 1)’ 


where the left member is expanded in full, by is substituted for b*, and a > 0,7 > 0, 
at+tj<p-l. 


2 . . 4 
For proof we take the relation 


0 (mod p’’), 


— S; k pk-1 i i 8 : 
(n 1) (p) = ps by «'Cus(") pr ) 
Pp a=] s=1 a 
where ” is prime to p and 
Ya = — (mod n) (0S ya <n); 


as well as the known congruence 
i i i : 2k 
1+ 2' +... + (p* — 1)' = p'd; (mod p”) 
fori even and p > 3, the latter condition holding in the theorem since a +7 < 


p—1. These give 


a 5 peel 2 , 
(n 1 )bei = yi yaa” * (mod p*). 


a=1 


2 


In the main theorem of my previous paper set s = 2, 


pi-l 


Sng = 1, 81 = 1, Be = —1, and we obtain 
(3) t(t?' — 1)’ = 0 (mod p’), 


where 


and ¢; is substituted for ¢' after the complete expansion of (3). Fore not divisible 
by (p — 1) we may select n to be a primitive root of p and in addition such that 


n”" = 1 (mod p’). Hence 
norer-) _ 1 = n° — 1 (mod p’), 
‘Given without reference in my other paper (see footnote 1). It may be obtained 


from the relation (5) in the writer’s paper in the Annals of Math., vol. 18(1917), p. 111, 
by setting m = pt and S = 1‘ + 2' +--+ + (p* — 1)%. 
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and after dividing by n° — 1 (# 0 (mod p)), (3) reduces to (1). However, for 
ca multiple of (p — 1) we note the known result that 

xz” — (1 + kp) = 0 (mod p’) 
always has a solution prime to p for any integer k, and set 

ne’ = 1 + kp (mod p’); 
then 
nu” —1] r 
: = rk+ (5) kp +--+ +k’ p"" (mod p’’). 

Now insert this value in (3), after dividing through by p, and then divide by k. 


We find that 
nu?» ‘eats 1 


pr(p — 1) 
has been replaced by 
1 l r 1 ei ri] 
+ > ( ) k + aia + k ’ 
p—1 r(p—-1)\2 P r(p — 1) 


modulo p’'. Collecting the powers of k in (3) thus obtained, we have 


(4) Ao + Arpk + --- + A,ak™"p™ = 0 (mod p”"), 

where 

(5) Ay = b°(b”* — 1)’. 

If we set k = 1, 2, ---, rin turn, we obtain, since r — 1 < p — 1, aset ofr 


congruences, and the determinant of the coefficients of the expressions A,p’ 
is an alternant consisting of products of the form (¢ — 7) (¢ < r,j7 <r), where 
i # j, and each of these is, in absolute value, < p. Hence 


(6) Ao = pA, = --- = p’ ‘A, = 0 (mod p’ '), 


and with (5) this gives our theorem since c = 0 (mod p — 1). 

It is obvious froth (6) that we have also a set of congruences moduli p””, 
p’*,--+,p’” in turn. 

At first glance it may appear that (2) is less general than (1) because, at least, 
the modulus is p’ ' instead of p’. But all the denominators of fractions occurring 
in (2) are divisible by p, so that if we multiply (2) through by p, we obtain by 
the von Staudt-Clausen theorem an expression on the left involving only frac- 
tions whose denominators are prime to p and the modulus is p’, and this is 
analogous to (1). 

The theorem was subject to the restriction a > 0. A modification of the above 
method will yield a theorem for a = 0. 

It is possible to extend these ideas so that we may set up a congruence of the 
type given at the top of p. 422 of my previous paper with some of the n’s zero, 





for 
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and if n; = 0, we substitute 6, for A; in the result and in lieu of p’ in the modular 
system on the right we reduce the exponent j by one for each n; that is zero. 
However, in a recent paper’ the Bernoulli, Euler and Genocchi numbers have 
been generalized and some congruences of the type just mentioned may be shown 
to hold involving these generalized numbers; so I shall not attempt to give the 
most general form for our theorem until we can see how far these ideas as to the 
generalized numbers extend. 


UNIVERSITY OF TEXAS. 


’ Vandiver, Proc. Natl. Acad., vol. 23(1937), pp. 555-559. 








THE INVARIANT THEORY OF THE TERNARY TRILINEAR FORM 
By JosePHINE H. CHANLER 


1. Introduction. Let z, y, z be three digredient ternary variables represented 


by points x, y, z in three respective planes Z,, F,, E.. We consider the 
3 


trilinear form F(z, y, z) = (ax)(By)(yz) = >> anijtnyz;, where the ay;; are 


h,i,j=1 
arbitrary complex numbers, and where we suppose that the form can be ex- 
pressed in no fewer variables. The form has 27 coefficients, or 26 projective 
constants, of which we may eliminate 3-8 by proper projective transformations 
in E,,#,,E,. Fora form thus involving only 2 absolute projective constants, 
a complete discussion and classification of types may be anticipated. 

In a joint article by R. M. Thrall and the author [12]' the classification of 
such forms was given under (i) non-singular linear transformations on the sets 
of variables taken separately, and (ii) interchanges of the sets of variables. 
Two forms equivalent under (i) were called equivalent; if equivalent under (i) 
and (ii), they were called generally or g-equivalent. In a previous article [11] 
Thrall classified analogous forms in a GF(p). For other references to recent 
studies involving the group-theoretic point of view [12] and [11] may be con- 
sulted. In [12] the approach was geometric; we studied the cubies X(x) = 0, 
Y(y) = 0, Z(z) = 0 obtained by equating to zero the determinants | M,(z) |, 

M,(y) , |_M.(z) |, where, e.g., M,(x) is the matrix (> a,ij;%). Particular 
h 


attention was paid to the cases where the cubics had singular points or de- 
generated; for all such cases canonical forms for F were obtained. While the 
case of the elliptic cubic was discussed briefly from the analytic viewpoint, 
the method of attack made it impractical to set up a canonical form. In the 
present paper such a form is determined algebraically (see §4). 

A promising method of investigation is the study of the concomitants of F. 
While certain of the concomitants have been previously studied, they have 
appeared rather in connection with bilinear than with trilinear forms. For 
example, C. Jordan [4] investigates the reducibility of the linear system of 
bilinear forms given by the trilinear form Tin. = D> dasyabtsty - For 1 = 
m= Nn 3, he bases his discussion of the form 7',,, on the position of \ with 
respect to the cubie in A given by the discriminant of the net of bilinear forms 
in uw, z. He does not mention the fact that this cubie is only one of three 
corresponding to our X(x) = 0, ete., which play equally important rdéles so far 
as Tim, is concerned. On the other hand, when J. Rosanes [8] studies the 3 


Received January 30, 1939. 
'‘ Numbers in brackets refer to the bibliography at the end of the paper 
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ternary bilinear forms f(z, y), f'(z, y), f°(z, y), he gets two cubics, D(z*) = 0, 
A(y’) = 0, which are the locus of the «' common null pairs of the 3 bilinear 
forms. In a later article [9] Rosanes obtains a determinant invariant D whose 
vanishing is the condition that the 3 bilinear forms can each be expressed as a 
linear combination of the same 4 special (factorable) forms. He also proves 
that the vanishing of D insures that the 3 forms can be sent by a linear trans- 
formation upon one set of variables into 3 symmetric forms. When B. Igel [3] 
studies the 3 ternary bilinear forms, he considers D(z*) = 0, A(y’) = 0 as together 
providing an analogue to the Jacobian curve of 3 quadratic forms. Maennchen 
[5] proves the equivalence (except for sign) of D with 2 other invariants of the 
net. The fact that it is an invariant of the trilinear form is used for the first 
time by Pasch [7] in proving the irreducibility of D. When D is so recognized, 
it follows readily from Rosanes’ results that when D = 0, one can send F = 
(ax)(By)(yz) into the polarized form of a ternary cubic by proper projective 
transformations in the three planes. This theorem I have not found in the 
literature. 

In §2 of the present article we set up the complete system of integral con- 
comitants of degrees 1, 2, and 3 for the form F. In §3 we give their geometric 
interpretation where possible; the curves X(x) = 0, Y(y) = 0, Z(z) = O are 
related more closely to the nets of quadratic transformations by means of the 
concomitants. In §5 important concomitants for the canonical form derived 
in §4 are determined, including the invariant D above mentioned. Two ab- 
solute constants for the form are obtained. In §6 we consider cases where the 
transformations set up between the cubics X(x) = 0, Y(y) = 0, Z(z) = O are 
periodic. The invariant conditions for the early cases are found, and interesting 
connections are made with certain points on the cubic, such as the sextactic 
points and the vertices of the in-and-escribed Hart triangles. 


2. Concomitants of degrees |, 2, 3 for the form / = (ax)(8y)(yz). We think 
of F as undergoing linear transformations on the sets of variables taken sepa- 
rately. Hence the concomitants, when symbolically expressed, will contain only 
factors of the types 


(gx), (ny), (gz), (ax), (By), (yz), (care), 


(B:B2n), (viv2h), (aya2as), (818283), (yrv27s), 


where subscripts are used for degrees greater than 1. 
For degrees 1 and 2 we have the following, besides combinations with (éz), 


(ny), (fz): 


Degree 1: F = (ax)(By)(yz) with 27 coefficients. 
Degree 2: F’, Qa = (BiB) (yryot) (air) (eer), 


Qs = (crook) (yrvet)(Biy) (Bey), Qy = (arerré)(8i82n) (12) (y22). 
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To check the completeness for degree 2 we note that F’ has 6-6-6 = 216 coeffi- 
cients, while each Q has 3-3-6 = 54. Thus we have a total of 378 which is 
the number of linearly independent quadratic combinations of the 27 coeffi- 
cients of F. 

For degree 3 we give the possible forms explicitly because of the syzygies to 
follow. Besides combinations with the absolute invariants (tx), (ny), ({z) we 
have: 


F* = (aux) (aor) (ax) (Bry) (Boy) (Bay) (v12) (v22) (y32), 
Qa-F = (B:B2n) (yrv2f) (arr) (ar) (asx) (Bsy) (732), 
Qs-F = (aycroé) (yrvet) (Bry) (Bey) (asx) (Bsy) (ys2), 
Q,-F = (ascot) (81829) (y12) (22) (asx) (Bay) (v2), 

Ba = (818283) (yrvzvs) (aX) (ar) (asx), 

Bs = (vrv27s) (arc203) (By) (Bay) (Bsy), 

B, = (acres) (818283) (12) (22) (32), 

Cas = (ejcr2@3) (81820) (Bsy) (112) (22) (722), 
Ca = (BiB28s) (aero) (axgx) (y12) (22) (v2), 
Cay = (arcrgcrs) (yrv2t) (ys) (Bry) (Bay) (Bay), 
Cya = (revs) (ara) (asx) (Bry) (Bay) (Bay), 
Cay = (8188s) (vrvef) (ysz) (air) (ar) (asx), 
C13 = (yrvzvs)(8:B2n) (Bsy) (ax) (a2r) (asx), 

Da = (cxyergers) (yrv2) (81830) (Boy) (ys2), 

Dg = (818283) (care) (yrvst) (y22) (asz), 

D, = (yrvzvs) (8:82) (rast) (aor) (Bay), 

E. = (aycroé) (81830) (yrvst) (asx) (Bay) (y22), 

Es = (B:B2n) (vrvst) (arasé) (Bsy) (v2) (a2z), 

EB, = (vrvet) (arersé) (8183) (vs2) (aor) (Bay), 

Ga = (B:Ben) (yrvst) (By) (v22) (a2) (ar) (asx), 

Gg = (vrvef)(arcrsé) (ysz) (aar) (By) (Bey) (Bay), 

G, = (ayant) (81837) (asx) (Bey) (112) (v22) (y32), 

H = (eycroé) (81839) (veya) (asx) (Bey) (112). 


These forms are related by the following syzygies: 

H + E, — E, + Ds-(ny) = 9, H + Es — E. + D,- (2) 
H+ E, — Es + D.-(x) = 0, 
3C.s — B,-(ny) = 0, 3Cza — B,-(&z) = 0, 3C a, — Bg- (fz) 
3C ya — Bg-(&z) = 0, 3Cs, — Ba-(gz) = 0, 3Cys — Ba-(ny) = %, 
6G. — 3Qa-F + Ba-(ny)-(t2) = 0, 643 — 3Qs-F + Bz-(&z)- (fz) = 
6G, — 3Q,-F + B,-(&z)-(ny) = 0. 

Hence we may represent all concomitants of degree 3 as linear combinations of 


(1) F. 0..F, & F, OF, B, Be, B.. Be, Be: De Be- 
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The number of coefficients for each of these is respectively 1000, 810, 810, 810, 
10, 10, 10, 81, 81, 81, 729. The total is 4432 whereas the number of linearly 
independent cubic combinations of the coefficients of F is Cos:3,3 = 3654. How- 
ever, the apparent discrepancy is due to the fact that we have not chosen 
normal forms for our fundamental system. If we make use of a theorem of 
Study’s ({10], p. 55) concerning the expansion of a connex (BX)"(UP)" in a 
series of powers of (UX), we can prove that the number of linearly independent 
coefficients in our system (1) is precisely 3654. Thus (1) constitutes a complete 
system of concomitants of degree 3 and no further syzygies occur for that 
degree. The great difficulty in obtaining normal forms is due to the lack of a 
unique expansion for a form in more than one pair of contragredient variables 


({10], p. 83). 


3. Geometric interpretations. For given r = 2%, 
(1) (ax) (By)(yz) = 0 


is the equation of a correlation between the planes E, and E,. If to points 
yo, y: in EZ, there correspond respectively the lines in E, , 


(S0z) = (axo)(Byo)(yz) = 0, = (S12) = (ato)(By:)(yz) = 9, 
then for any arbitrary line £ on fof, , 
(Sos1S) = 3(vrv2k) (BiBeyoy:) (aX) (aero) = 0. 
Putting » = yoy: in E,, we have for x = 2 
(2) Qa = (B:B2n) (vive) (ait) (agr) = 0 


as the dual form of the correlation (1). If y, z are in turn taken as given con- 
stants, (1) represents in turn correlations between E, and E, and between 
E, and E, for which the dual forms are given by Qs and Q, , respectively. 

If = zp in (1) and (2), and if yeisa third point in EZ, corresponding to (fz) = 0 
in B, , the 3 lines ({oz) = 0, (f:z) = 0, (fz) = O are concurrent if 

(FoFis2) = §(Youry2) - (818283) (yrv27vs) (arto) (a2%o)(asxo) = 0. 

Thus ordinarily the 3 lines ¢ are concurrent only if the 3 points y are collinear. 
If, however, 2 is on the cubic in Z£, given by 


(az)Buyu = (mr)Buy2 (ar) Buyis 
(3) X(x) = $Ba =  (aax)Bayyn = (a2r)Bovye2 (ar) Boxy2s = 0, 
(ast) Bssys1 (ast) Bssys2 (vst) Baxyas 


the three lines are on a point zo, whatever be yz ; that is, (ax)(By)(yz) = 0 
iny. We say that zo is the singular point of the correlation in E, , and because 
of the symmetry there is also a singular point yo in E,. The correlations 
(ax)(Byo)(yz) = 0 and (ax)(By)(y2o) = 0 are also singular with x as a singular 
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point. ‘The corresponding cubies Y(y) = 0, Z(z) = 0 are given by By and B, ; 
and we have 

THEeorEeM i. The cubics X(x) = 0, Y(y) = 0, Z(z) = O given by B. , Bs , B,, 
respectively, are such that each gives in its plane the locus of points for which the 
correlation (1) is singular. 

If x = x is on X(x) = O, (2) becomes the product of the singular points in 


E, and E, , or 


(2a) (8:82) (vrv2e) (ato) (a2t0) = p( myo) - (FZ). 


Thus a transformation ,7', is set up between X(x) = 0 and Y(y) = 0, by which’ 


yo on Y(y) = O corresponds to x on X(x) = O if yo is the singular point in £, 
of the singular correlation determined by x. Also a transformation ,7’, is set 
up between X(x) = 0 and Z(z) = 0, by which z on Z(z) = 0 corresponds to 
on X(x) = 0 if 2 is the singular point in EZ, of the singular correlation deter- 
mined by 2». Analogous transformations ,7,, ,7', and .7,, .7’, are given by 
forms derived from Qs, , Q, as (2a) was derived from Q,. By the remark pre- 
ceding Theorem 1, if 2, yo correspond under ,7',, they also correspond 
under ,7', . 
If in (2) we fix » = m, 


(2b) Qez = (B:B2m0) (viet) (az) (eer) = 0 


gives us a quadratic transformation carrying £, into Z, in such a way that 
X(x) = 0 goes into Z(z) = Oas under,7,. The 3 direct F-points on X(x) = 0 
correspond to the 3 points yo in which 4 meets Y(y) = 0. If in (2) we fix 
¢ _ So ’ 


(2c) Qe, = (B:B2n) (yrvefo) (ax) (aor) = 0 


gives us a quadratic transformation carrying EZ, into EF, so that X(x) = 0 goes 
into Y(y) = 0 as under ,7’, and so that the 3 F-points on X(z) = 0 correspond 
to the points z in which {> meets Z(z) = 0. Hence (2) gives us a system of 
conics Q(n, ¢) which cut out two g}’s coresidual to each other in the g§ cut out 
by all conics on X(xz) = 0. Thus, for fixed » and variable ¢ we have a system 
of conics on a fixed triad g3(n) which cut out variable triads g2(¢). A triad 
g2(n) and the corresponding triad of fundamental F-points in E, constitute in 
Rosanes’ sense a pair of polar triangles common to one pencil of the net of 
bilinear forms in z, z given by (1). By bordering the determinant of X(z) 
in (3) with ’s and ¢’s, we get the system Q(n, ¢) in the form 


(az)Buyun (ar)Buy2 (air)Buyis =m 
(agr)Boyyn (aet)Bavye. (a2r)Boryes ne 
( = —19, = = ( 
4) Ong) 20a (ast) Bsxys1 (asr)Bsxys2 (ast) Baxyss 0s 


$1 $2 $3 0 
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Q; and Q, give us similar systems Q(é, ¢) and Q(y, &) in planes EF, and £, , 
respectively. 


4. Canonical form for F when X(xr) = 0, Y(y) = 0, Z(z) = 0 are elliptic. 
From [12] we sum up the necessary preliminary theorems: 

If one of X(x) = 0, Y(y) = 0, Z(z) = 0 is a non-evanescent elliptic cubic, the 
others must be the same. Since in this case the cubics are birationally equiva- 
lent (under quadratic transformations), they must be projectively equivalent. 
We may therefore, by a suitable non-singular linear transformation on z, insure 
that Z(x) be a constant multiple of X (x), and think of £, as superimposed on E, . 
The net of quadratic transformations (given in the present paper by Q,.) will 
then transform X(x) = 0 into itself. For the general elliptic cubic the only 
such transformations are (i) the involutions interchanging the members of every 
pair of points collinear with some fixed point of the cubic; (ii) the products of 
two such involutions. 

We now turn to the algebraic treatment. As in [12] we use the theorem 
that the classification of the forms F is abstractly identical with the classifica- 
tion of the matrices M,(z) = (> ayi;a,) under (i) multiplication on left and 

h 


right by non-singular constant matrices and (ii) non-singular linear transforma- 
tions on x. In the course of the work it will be shown that for the classes of 
forms obtained, no equivalences are introduced by interchange of the sets of 
variables. Any of our matrices M,(x) can be obtained by setting up a net of 
quadratic transformations carrying X(x) = | M,(x) | = 0 into itself. First we 
consider the transformation effecting on X(x) = 0 the involution Jg for which 
a flex point Q is the center. The net of quadratic transformations is equivalent 
to a collineation; that is, the form Q(n, £) of §3 factors into Q:(7)-Qe(f), where 
Qi(n) is linear in n, Qe(f) is linear in ¢, and both are linear in x. Hence if 
Q(n, ¢) is taken as a bilinear form in , ¢, its determinant is of rank no greater 
than 1, whatever be the value of x. Since this determinant is to within a sign 
the adjoint determinant of | M,(x) |, we must have X(x) = 0. This case does 
not fall within the range of the present paper; indeed, its canonical form was 
given in [12]. 

We next consider transformations producing the involution 7, whose center 
is the generic point a@ upon the cubic. Since a is not a flex point, the contact 
points of the four tangents from a to the cubic are independent, and we may 
take them to be (1, 1, 1), (1, 1, —1), (, —1, 1), (—1, 1, 1). Since the reference 
triangle is now also upon the cubic, the curve’s equation becomes 


2 2 2 2 2 2 
(1) QX\(Le — 23) + AeXe(X3 — X}) + QgX3(2X _ 2) = 0, 


where (a; , @2, @3) is the point a. Under the involution the four contact points 


2) 


are fixed; also 


x = (a, 0, as) <> z = (0, 1, 0), x = (a, a2, 0)<>2z = (0, 0, 1). 
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Setting up these correspondences, we get three independent bilinear forms in 
z and 2z: 
212) — r3z3 = 0, 
(2) Tz — rxz3 = 0, 
(GoX%3 — A3%2)Z1 + (Q3%1 — 1X3)Z2 + (AX. — AexX\)23 = 0. 
We have therefore a net of quadratic transformations Q,.(a), each of which 
effects the involution J, on the cubic. Eliminating the z’s, we get the corre- 
sponding M,(z). Finally, we consider a transformation 7'p of the cubic which 
is the product of two involutions. If 7p is produced by a quadratic trans- 
formation at all, it is produced by the product of a transformation of type 
Q..(a) and an automorphism of the cubic corresponding to a flex involution J. 
Thus the matrix determined by any 7'p is equivalent to the matrix determined 
by some J, . 
We have now proved that every class M,(x) can be represented as 
Zi 0 —Zs 
(3) 0 Z2 —TZ3 
QeX3 — AzgXe AzX; — AyX3 AyLe — AX) 

We want next to discover what restrictions must be imposed on the a’s, and 
under what conditions two different sets of a’s give matrices in the same class. 
For this purpose we study the geometric configuration presented by the net of 
cubics {a;, a2, a3} = {a} given by (1) as the point a varies. These comprise 
the totality of cubics on the points 


(1, 1,1), (1, 1,—1), (, -1, 1), (—1, 1, 1), (0, 0, 1), (0, 1, 0), (1, 0, 0). 


The point a is the point for which the cubic {a} is the isologue in the trans- 
formation zz; = 1. The four tangents from a to {a} are 


(dz — a3)%: + (G3 — a)a2+ (a; — e)x3 = 0, 

(a2 + d3)%1 + (—a3 — ))r2 + (—a) + Ae)x3 = 0, 

(de + d3)t1 + (3 — G))X2 + (—Q) — Ae)a3 = 0, 
(—d2 + d3)ti + (3 + G1)t2 + (—G) — Ge)x3 = O, 
for which the four contact points are respectively (1, 1, 1), (1, 1, —1), (1, —1, 1), 
(—1,1,1). One value for the cross-ratio of the tangents is (a? — aj) /(a3 ai); 
the other five may be obtained by permutations of a, , a2, a3. The polar conic 


(4) 


of a with respect to {a} is 

(5) ai(xz — 23) + a3(z3 — xi) + a3(zi — 22) = 0. 
As a varies over the plane, the conic (5) ranges in the pencil 
(6) axe — 23) + u(zs — zi) = 0. 


The conic’s parameter \/u in (6) is precisely the cross-ratio invariant of the 
corresponding cubic. The three degenerate members of the pencil, 


2 2 2 2 2 2 
Ie — 23; = 0, x3 — 2 = 0, x; — r2 = 0, 








3 in 
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give the locus of points a for which {a} has at least one double point. In fact 
for these points the cubic degenerates. Hence for our matrix M,(x) we must 


avoid the values a for which 
-” 2 2 2 2 2 2 
(7) (ag — a3)(a3 — @;)(a; — ae) = O. 


As to equivalence, we note that the M,(x)’s determined respectively by the 
points a and b are equivalent if and only if there exists a collineation that sends 
‘a} into {b} in such a way that J, coincides with J,. For this to occur the set 
of points (1, 1, 1), (1, 1, —1), (1, —1, 1), (—1, 1, 1) can at most be permuted. 
There are therefore 24 collineations sending {a} into cubics with equivalent 
matrices, and every point a is in a set of 24 equivalent points obtained by 
permutations and changes of sign of the coérdinates of a. Because of the 
method of construction of M,(x) and the involutory character of the corre- 
sponding transformations, interchanges of the sets of variables x, y, z introduce 
no g-equivalences. To sum up we state 

THEOREM 2. Every ternary trilinear form F = (ax)(By)(yz) for which X(x) = 0 
is a general elliptic cubic can be represented in the canonical form 

F = xyyrtr + reyote — Layi2s — Tayo%s + Aetsysts 
(8) 
— Asl2ys21 + AsliYs22 — ATsYs%o + ALeYs23 — AsTiYs?s . 
We must avoid the values a given by (7). The cubics X(x) = 0 with the same 
absolute invariant are the isologues of points on the 6 conics in (6) whose param- 
eters are the 6 possible values of the cross-ratio invariant of the cubic. If we select 
one of these conics and consider on it only one point in each set 
(a; , dz, as), (a; , @2, —4s), (a; , —@2, as), (—a; , a2, as), 

we shall obtain one and only one representative for each class of forms corresponding 
to the given projective class of cubics. 


5. Certain concomitants of / calculated for the canonical form; two absolute 


projective constants. For F given above (§4, (8)) we have immediately 
X(x) = 3Ba = qax,(x — 23) + aore(x3 — xt) + asxs(zi — 23), 
Y(y) = {Bs = (ai — ad)yys + (ad — as)yoys — yrye(yr + 2), 


r, 2 2 2 2 2 2 
Z(z) = 6B, = ayz(23 — 22) + Gaze(Z1 — 23) + Gs2s(z2 — 21), 


x 0 —X m 
) a= 
Q(n, $) = -3Qe = " nee: 
‘ QeX3 — AsXe G3%¥,; — AyX3 AyTe — AX, Ns 
S| $e ts 0 
Y1 AsY3 — AgYs £1 
. — A3Y3 Y2 ays £2 
QE, $) = —3Q3 = ag 
A2Y3 —QyYs i" ie $3 


e 


is) v2 ts 0 
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21 0 —%% m 
0 22 —2Z3 Ne 
= —4i = - 
Qn, 8) 1Q1 = | ata — ety its — Osh sey — Ht Ns | 
fi £2 §s 0 | 


It is of interest to calculate the quartinvariant S and the sextinvariant 7 for 
the cubic X(z) = 0. We use Cayley’s table ((1], p. 325), but to get our S and T 
multiply those given by Cayley by 81 and —+3®, respectively. 

S= aj + az + a3 — aja; — aga; — ajai, 

T = (2a3 — ai — a;)(2a; — a — aj)(2a; — az — aj). 
We may arrive immediately at these expressions by noting that if S = 0, the 
cubic is equianharmonic with cross-ratio (a3 — a})/(a; — a3) = o, where o - 
o + 1 = 0, while if 7 = 0, the cross-ratio (a} — a})/(a} — a3) = —1, 2, or }. 
The discriminant is 

R = 48° — T° = 27[(ai — a3)(az — a})(aj — aj)’, 

checking the range of excluded values given by (7), §4. The cases where R = 0 
are considered explicitly in [12]. Considered as invariants of the ternary tri- 


linear form, these invariants have the following degrees and weights, the weights 
being calculated for each set of variables: 


S: degree 12, weight 4; 
T: degree 18, weight 6; 
R: degree 36, weight 12. 
Written symbolically for F = (ax)(Sy)(yz), 
S = Ky (excrqers) (argerseag) (crzexgex9) (2100111012) (818487) (8285810) (Bs8s811) (BoB 12) 
- (yrverz) (veverw) (ysvev1) (ever), 
T) = Ko(cxcrgcrs) (cxgcryexs) (cxzexgexg) (041000310412) (cry300140015) (cxye0r170%18) 
+ (818487) (B28sB10) (8 s8s811) (BeB1a816) (89814817) (812868) 
 (vvvers) (veer) (ysverus) (yevisyis) (ysvisviz) (YizvisYi); 


where K, , Ke are constants. For the canonical form the invariants S, T, R 
are precisely the same for all the cubics. For the form F = (ax)(By)(yz) the 
condition Rk = 0 gives the cases of singular cubics studied in [12]. 

The invariant of lowest degree for F = (ax)(By)(yz) is (arcrees) (cxgersers) (8183835) 
- (BsBoBo) (vrveve) (veve7s), of degree 6 and weight 2 for each set of variables. It may 
be obtained as the apolarity invariant of any one of D, , Ds, D, with itself. 
For the canonical form F, it is to within a constant factor equal to 


2 2 2 
Ip = a; + ap + G3. 
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The invariant D mentioned in §1 is given by Pasch for the form >> aacjanyiz; 
as a determinant of order 9. Of degree 9, it is of weight 3 for each set of vari- 
ables. For the canonical form F, it is to within a constant factor equal to 


Is = a020; , 


which will figure in §6. 

If we add the invariant J, = aja} + a3a; + aja; = 3(13 — S), of degree 12 and 
weight 4 for each set of variables, we may set up two absolute projective con- 
stants for the ternary trilinear form: I,/I? and I3/I2. When definite values 
are assigned to these absolute invariants, and the resulting expressions cleared of 
fractions and treated as equations of the fourth and sixth degrees respectively 
in the homogeneous variables a; , dz , a3 , we get 24 independent solutions. Since 
the equations are symmetrical in the a’s and contain their even powers only, 
these 24 comprise a conjugate set of a’s which produce equivalent forms. Hence 
we have 


THEOREM 3. Two ternary trilinear forms for which X(x) = 0 is a general 
elliptic cubic are equivalent if and only if they have the same absolute invariants 
L/Iz, I/It. 


Since the determination of a class of forms depends upon the choice of the 
absolute invariant of the cubic X(xz) = 0, as well as upon the selection of the 
point a (and therefore the coresidual g3’s) upon the cubic, it would be advanta- 
geous to choose for one of the 2 constants the absolute invariant J = S*/T7”. 
If, however, J and J3/J3 are taken for the constants, we get for definite values of 
these constants, 72 sets of a’s which break up into 3 equal groups; the forms 
produced by a’s in the same group are equivalent but not those produced by 
a’s in different groups. Likewise the constants J and J,/J; determine the forms 
only to within 2 non-equivalent classes. Nevertheless we shall often use J in 
the next section. 

By the simultaneous transformations 


; , / 
q; = qe, "= Y2,; 21 > 22, 
/ , aa ‘i 
(1) m= 2X, y= Ny, z= %1, 
, , , 
t3 = 73, ¥s = —Ps, 23 = 23, 


F may be sent into a form differing from F only in the permutation of a , as. 
Since the weight of any (integral) invariant must be the same positive integer 
for the three sets of variables, any such invariant is multiplied by unity when the 
transformations (1) are carried out on F. Hence all (integral) invariants of F 
are symmetric in a , dz and may be similarly proved symmetric in a , a2, a3. 
They are therefore integral rational functions of the elementary symmetric 
functions 


8 = a + A, + 43, 8g = QjQq + A203 + asd), 8 = 1;. 
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Of these, s; is an algebraic invariant (cf. [10], pp. 10, 11) of degree 3 and weight 1 
for each set of variables; s2 is an algebraic invariant of degree 6 and weight 2. 
For F they are determined respectively as roots of the following equations: 


st — 4]osf + (613 — 81,)st + (1622, — 472 — 6473)si + 12 — 8121, + 1675 = 0, 
(2) 8. — 21483 — SIjse + Ii — 40312 = 0. 


Also 
1 
a = 3 (82 = Is). 


The 8 conjugate values of s, and the 4 conjugate values of se given by equations 
(2) are obtained for the canonical form F by changes of sign of a; , a2 , as . 

If we take s2/s;, 8;/s; as the absolute constants of the form F, we get for 
definite values of these constants, 6 sets of a’s which all give rise to equivalent 
forms. However, any definite pair of values for 82/8; , 83/8: is associated with 
7 other pairs which give rise to the same class of forms. 

We have almost immediately the theorem (verified by Maennchen’s calcula- 
tions and transformations) that D or J; is the sole integral invariant of degree 9. 
For any such invariant, being of weight 3, must be a linear combination of 
8, 8182 and s; = 7;. However, the first two forms are not integral rational 
invariants since any integral invariant containing s,; as a factor must contain 
its conjugates and be of weight 8 at least. 

6. Periodic transformations among the cubics X(x) = 0, Y(y) = 0, Z(z) = 0. 

(a) General discussion. A consideration of Q.. , Qs, (§3, (2b), (2c)) leads to 


THEOREM 4. We assign the elliptic parameter u on X(x) = 0, v on Y(y) = 0, 
and won Z(z) = 0, so that u is the canonical parameter on X(x) = 0, while x, 
Yo , 20 have equal parameters u = v = w Uf yo , % correspond to x under ,T,, 2T:, 
respectively. Then on Y(y) = 0, Z(z) = 0, the collinear conditions are respectively 

v1; + v2 + v3 = k, Ww, + We + w; = —k. 

If we consider the systems of conics Q(é, ¢) and Q(», &) with their correspond- 
ing sets of transformations, we have for the g2’s corresponding to the line sections 
in the various planes 

X(z) = 0, Y(y) = 0, 
lines —& 2 uy + Ue + uz = 0, qo (é) 20 + ve + v3 = O, 


gin) im +uw+wu=_ k, lines 7 : v; + ve + v3 = k, 
g2(¢) > + U2 + us = —k, go (¢) > 0 + v2 + v3 = 2k, 
Z(z) = 0, 

g: (tf) :wtutw;= 0, 


g2°(n) : wi + we + ws = —2k, 
lines € : wy + We + wy; —k, 





ht 1 
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where, e.g., in the first line the points of the triads g:°() on Y(y) = 0, and 
go (é) on Z(z) = 0 correspond to the points of the line — on X(x) = O under 
Ty, 27’. , respectively. Hence if yo, z correspond under ,7,, the relation 
between their parameters is 


w=v—k. 
There follows immediately 
THEOREM 5. Let a sequence 
(1) LoYoZTiY1 -- + ZXYi2iz1 --- 


be given, where x; corresponds to z; under .T, , yi corresponds to x; under ,T, , and 
2ix, corresponds to y; under ,T'.. Then the necessary and sufficient condition that 
the sequence be periodic is that k be a submultiple of an elliptic period. The same 


conclusion holds for the analogous sequence 
ToeoYiX121 +++ YitYinr ---- 

For (1) the corresponding sequence of transformations on the elliptic param- 
eters is 
(la) Vo = UW, Wy = Wo — k, uy = | — k, 4 = | —k, We = Wy — Jk, 

We shall assume that 
(2) nk = 0 mod (@ , ws), 
where w; , w: are the fundamental elliptic periods, and shall consider the trans- 
formation on X(x) = 0 produced by .7,-,7.-:7,. Obviously similar ones take 

° owe ° » . ° 3 Bion: 

place on the other cubics. Then in EZ, we have for triads g2() and g2(f), re- 
spectively, 


r 2 X 9 
(3) Uy + ue + us = w+ =, Us + Us + Us —— + = 


n n 


Ii 


° 3 ° | ee ° ° ° . 
and the series g2(n) (or the series g2(¢)) is cut out by an n-ie with n-fold contact 


3 
at the 3 points given by wm , Ue, Us (Or uy, Us, Ue). Lin > u; = 0, the conies on 


i=l 


3 
° 3/5 ° . 
Uy, Ue, Us Cut Out a series go(f) such that for each group uy, Us, Ue In it, > Uj 


=! 


6 6 
+ pm u; = 0; whence n > u; = 0. We thus have 


i=4 i=4 
TuroreM 6. The existence on the cubic curve of one group UW , Us, Us such that 
3 

n Du 
i=l 


produce periodic transformations. 


. ‘ , : 3 3/> ° 
insures the existence of two coresidual series go(n) and go(¢) which 


) 


’ 


An n-ie with n-fold contact at 3 points is subject to 3n conditions. However, 
only 3n — 1 points can be chosen on the cubie to determine the n-ic. Hence 
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there is always one condition on the cubic and the set of points uw; , ue , us that the 
set be in a g2(m) or g2(¢) of the desired type. From analysis it is obvious that this 
condition can be put upon the set of points. The two coresidual g3(n) and 
g2(¢) are identical only when n = 1 or 2. In all other cases the 9 collineations 
of the cubic into itself given by the flex involutions: 3(u’ + u) = 0 interchange 
the g2(n) with the g3(¢), while the 9 collineations 3(u’ — u) = 0 send each series 
into itself. 

(b) Case n = 1. Here the transformation carried out on X(x) = 0 by 
27',-yTs-21'z is the identity. Since the series g3(m), g2(¢) are cut out on X(z) = 0 
by all the straight lines in E, , the system of conics Q(n, £) factors as in §4, as do 
Q(é, ¢) and Q(n, &). Hence X(x) = Y(y) = Z(z) = 0. The identical vanishing 
of any 2 (and therefore of all 3) of B. , Bs, B, constitutes the necessary and 
sufficient conditions for this case. We cannot set up the conditions on the a’s 
as for the other cases, since this situation cannot occur for our canonical form. 

(c) Casen = 2. Here the g2(n) = g2(¢) and k is one of 40; , we, $(w1 + ws); 
whence we have 

THEOREM 7. Given the invariant J, there exist 3 non-equivalent classes of ternary 
trilinear forms for which the transformation ,T,-,T,-:T,0n X(x) = 0 is of period 2. 
The g3’s corresponding to these classes are cut out on X(x) = 0 by the contact conics 
constituting the 3 systems of poloconics of the lines of the plane with respect to the 
3 cubics for which X(x) = 0 is the Hessian. The conics of the system Q(n, £) are 
the mixed poloconics for the line pairs of the plane. (For definitions and references 
for these systems of conics cf. [2], pp. 471-472; [6], p. 379.) 


For our canonical form let the elliptic parameter of the point abe a. Then the 
parameters of (1, 1, 1), (1, 1, —1), (1, —1, 1), (-—1, 1, 1) are (not necessarily in 
this order) congruent to — $a, $(—a +1), $(—a@ + we), $(—a +1 +2). The 
diagonal triangle of this four-point is the reference triangle; its vertices have 
parameters congruent to a + 401, a + $we, a + $1 + w). Since these 
vertices constitute a triad g2(n), we must have 6a = 0. There are 36 such points, 
of which the 9 flexes must be discarded, leaving for possible a’s the 27 sextactic 
points which are the intersections with the cubic of the 9 harmonic polars. 
From the parameters it is evident that a must lie on one side of the reference 
triangle. Hence 


TueoreM 8. In generalif R # 0, the necessary and sufficient condition that the 
transformation ,7,-,7'.-2,T,0n X(x) = 0 be of period 2 is that I; = 0. 


When a is chosen, two vertices of the reference triangle prove to be the two 
other sextactic points upon the harmonic polar through a, while the third is the 
corresponding flex point. Thus the particular g2 depends upon the choice of 
2 out of 3 sextactic points to serve with their flex as a group of the g3 . 

(d) Case n = 3. If we take a to be the parameter of the point a as in case 
(c), we find that here 9a = 0, or a is a coincidence point (as defined in [2], p. 501). 
Excluding the flexes, there are 72 such points; taking account of the 18 collinea- 
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tions of the general cubic into itself, we should expect 72 = 4 non-equivalent 
a’s;i.e., 4 non-equivalent pairs of coresidual g3’s on the cubic. Furthermore the 
72 coincidence points are known to be the vertices of the 24 Hart triangles in- 
and-escribed to the cubic ([2], p. 501). If the vertices of such a triangle have 
parameters U,, U2, Us, We have 3(u, + we + Us) = 0; hence these vertices 
constitute a group of g2(n) or of g2(¢) of the type described. It is known that the 
Hart triangles can be divided into 4 sets of 6 each such that the triangles in a set 
are permuted among themselves by the 18 collineations. The vertices of the 6 
triangles in a set prove to be 3 groups of one g2() and 3 groups of its coresidual 
g:(¢); the 8 ternary cyclic collineations permute the triangles of each series among 
themselves; the 9 flex involutions interchange the triangles of g2(n) with those of 
g:(¢). If a has parameter a, the sum of the parameters of the vertices of the 
Hart triangle of which a is a vertex is congruent to 3a; this is precisely the 
congruence sum obtained for the group g2(») of the reference triangle. 


THEOREM 9. (Given the invariant J, there exist in general 4 non-equivalent classes 
of ternary trilinear forms for which the transformation ,T,-,T,-:T, on X(x) = 0 is 
of period 3. The 4 pairs of coresidual g3’s correspond to the 4 sets of Hart triangles; 
each pair of g2’s contains as groups the triads of vertices of one such set of 6. 


Three non-collinear flex points also constitute a group of such a g3(n) or 
g2(¢), as do one flex point and a second flex point taken twice. Of the flex-group 
for which we have one repeated, we have 2Cy2 = 72; of the flex groups of 3 
different points we have Cy; — 12 = 72, since they must be non-collinear. The 
argument goes forward as before, giving {7 = 4 non-equivalent pairs of core- 
sidual g2(n), 93(¢). 

To determine the invariant for n = 3, we use the fact that the point a is its own 
third tangential. The first tangential is 


, , , ‘. 
(4) X, = 2A, Xe = A3Q,, X3 = A,Q2; 
the second is 
” 22 22 22 ” 22 22 22 
X, = G2Q3(a2a3 — a3a; — aja2), Xe = 30;(a3@; — ajdz — Aea3), 
(5) — 22 2.2 23 
3 = Q02(a\a¢2 — Q@eQ3 — @3@}). 


Writing out the equation for the tangent at x’ and imposing the condition that 
the codrdinates of a satisfy this equation, we get 


TueoreM 10. IJngeneral, if R # 0, I; # 0, the necessary and sufficient condition 
that the transformation ,7',-,T,-2T', on X(x) = 0 be of period 3 is that 


(6) ai(a; — a3)° + a3(as — aj)* + as(a; — az) = 0. 


As a check consider the number of independent solutions of (6) solved simul- 
taneously with the conic in the a’s for given cross-ratio \/y: 


(7) A(a} — a3) + u(a — at) = 0. 
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We have 2-14 = 28 solutions, of which 4-3 = 12 must be excluded, since 


(1, 1, 1), (1, 1, —1), (1, —1, 1), (—1, 1, 1) are triple points of (6) and simple 
points of (7). Remembering that the 4 sets of values (a, , +a2, +43) produce 
forms in the same class, we get +;* = 4 non-equivalent classes. 


1. 
2. 
3 
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THE PROBLEM OF TYPE FOR A CERTAIN CLASS OF RIEMANN 
SURFACES 


By F. E. Uric 


1. Introduction. In the present paper there are obtained sufficient conditions 
that a certain class of open simply-connected Riemann surfaces shall be of 
parabolic type,’ that is, can be mapped one-to-one and in general conformally 
on the plane with one point removed. The surfaces to be considered are, briefly, 
surfaces which have a single transcendental singularity which is a limit point of 
algebraic branch points of first order, and a logarithmic branch point. 

The results are derived from a criterion due to Ahlfors.” It can be stated as 
follows. Let the open simply-connected surface W be spread out over the 
w-plane and a metric on W be defined by a differential form 


do = X(u, v) | dw\, w=u+ w, 


where A is a real, single-valued function, continuous on W with the exception of 
certain isolated points. Moreover, let the metric be so chosen that no singu- 
larity of the surface is accessible along a path of finite length. Let W, be the 
region of the surface consisting of those points whose distance from a certain 


initial point Po , in the metric considered, does not exceed a positive number p. 
Let L(p) be the length of the boundary of W, in the metric considered. Then, 
a necessary and sufficient condition that W be of parabolic type is that there exist a 
metric of the type defined above such that the integral 


| * dp 
L(p) 
diverges. 


2. Class of surfaces to be considered. The surfaces W to be considered are 
of the following sort. 

Let {A,} (v = 1, 2, 3, ---) be a countably infinite set of points of the real 
axis of the w-plane, which has as sole limit point the point at infinity. More- 
over, suppose A, > 0 for v odd and A, < 0 for vy even. Over each point of 
|A,} shall lie one and only one algebraic branch point of first order. There 
shall be no other algebraic branch points. There shall be a logarithmic branch 
point over w = 0 along with an infinite number of smooth sheets. There will 


Received February 3, 1939. 
' The writer wishes to express his thanks to Professor Ahlfors for suggesting the problem 
and for the valuable counsel given by him. 
*Comptes Rendus, vol. 201(1935), pp. 30-32. 
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be a transcendental singularity over w = ~, which is a limit point of algebraic 
branch points.’ 

Since all branch points and singularities of W lie over points of the real axis, 
the sheets can be divided into half-sheets by a cut along this line. All branch 
points and singularities will then be on the boundaries of these half-sheets. 

The half-sheets thus formed shall in general be of two sorts. The upper and 
lower half-sheets of the first kind shall be denoted respectively by S; and S, 
(v = 1, 2, 3, ---) and be spoken of as the algebraic half-sheets. For each », 
both S; and S; shall abut on the algebraic branch points over A, and A,,,;, 
as well as the transcendental singularity over w = «, but together form a 
smooth sheet over each of the remaining points of {A,} as well as over w = 0. 

It is convenient to separate the half-sheets of the second kind in two infinite 
sets. The upper and lower half-sheets of the first set shall be denoted respec- 
tively by QF and QF (v = 1, 2, 3, ---), those of the second set respectively by 
Q*, and Q-, (v = 1, 2,3, ---). Each half-sheet of both sets shall abut on the 
singularities over w = Oandw = ©. But an upper and a lower half-sheet with 
the same index together shall form a smooth sheet over each point of the set {A,}. 

There shall be one remaining sheet, the upper and lower half-sheets of which 
shall be denoted respectively by Qj and Q> . Both Qj and Qo shall abut on the 
singularities over w = 0 and w = © as well as the algebraic branch point over A;, 
but together shall form a smooth sheet over each of the remaining points of {A,}. 

The half-sheets QF (v = 0, +1, +2, ---) shall be spoken of as the logarithmic 
half-sheets. 

The half-sheets are joined in the following way: 

The two sets QF (v = +1, +2, +3, ---) form two logarithmic ends, one 
formed by the set QF (v = 1, 2, 3, ---) joined to Qj along the segment (0, — ~), 
the other by the set Q= (v = —1, —2, —3, ---) joined to Qo along (0, — ~). 
More completely stated, to Qj_, is joined Q; along the segment (0, — ~) and to 
Q, is joined Q; along the segment (0, + ~) (v = 1, 2, 3, ---). This joining 
forms one of the logarithmic ends. Then to Q>4,; is joined Q; along (0, — ~) 
and to Q; is joined Q) along (0, + ~) (v» = —1, —2, —3, ---). This joining 
forms the second logarithmic end. 

To complete the surface W, Qo and Qo are joined along the segment (0, Ai) 
and the algebraic half-sheets are adjoined as follows: To Qo is joined Sj along 
the segment (A, , + ~), to Qo is joined S; along (A, , + ~) and to Sj is joined 
Sy; along (A; , Az). Now let (A,, ©) (v = 1, 2,3, ---) be that infinite segment 
which is on the real axis of the w-plane, which has finite end point A, , and which 
does not contain the point A,,,. Then for »v = 2, to S, is joined S; along 


* It should be pointed out at this time that the requirement that the points A, have 
the point at infinity as a limit point is entirely unnecessary for the work to follow. But 
it will be seen (§11) that the sufficient conditions obtained are not met unless | A, | be- 
comes infinite with increasing vr. If the | A,| are bounded, the character of the singu- 
larity of W over w = @ is changed. Instead of a transcendental singularity of the sort 
described above, there will be two logarithmic branch points. 
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(A,, Ava), Spa along (A,, ©) and Sy; along (A,41, ©). To S> is joined S; 
along (A, , Ay41), S,-1 along (A,, ©) and S,4,; along (A,4;, ©). 

The algebraic half-sheets S; and S, together shall form the algebraic sheet 
S, (v = 1, 2, 3, ---), and the logarithmic half-sheets Q; and Q, together shall 
form the logarithmic sheet Q, (v = 0, +1, +2, ---). 


3. The metricon W. A metric on the surface W will be defined by the differ- 
ential form 


do = X(u, v) | dw], w=utw, 


« being are length in the non-Euclidean metric. The function A(u, v) = A(w) 
shall be a real, single-valued function, continuous on W except for isolated points. 
\(w) is defined as follows: Let & be a place on W over the point w of the w-plane. 
Let A(w) be the Euclidean distance on the surface from w to the essential singu- 
larity over w = 0. Here distance is used in the sense of shortest distance, or 
more precisely, the greatest lower bound of all Euclidean distances on W from 
to the singular point over w = 0. We then define 


1 
Xr = ee 
(w) A(w)’ 
and have 
_ |dw| 


do = A(w) 


In the logarithmic sheets Q, , since they abut on the singularity over w = 0, 


A(w) = |w|. 


Since the algebraic sheets S, do not abut on the singularity over w = 0, in 
determining A(w) for w in one of these sheets, it is necessary to follow a path on 
the surface from @# into one of the sheets which do abut on this singular point. 
From the way in which the sheets of W are connected, it is clear that a shortest 
path is determined by going from @# in the sheet S, in question, through the 
sheets S,- with index v’ < », into the sheet Qp and hence to the singularity. We 
then have for # in S,, 


A(w) = |w — A,| +|A, — Aral + +++ +] Ae — Ar| + A. 
If we let 
(1) |A, — Avia] = Pri, 
and 


(2) G=Pptmat-::: +p, 21, 


the analytic expression for the differential form defining the metric on W has the 
following form in the various sheets. 
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In all the logarithmic sheets Q, , 
(3’) de = | dw 
| w | 
In the algebraic sheet S, , 
| dw | 


6") O° CC Eita' 


where ¢, is defined by (1) and (2). 


4. Geodesic lines in the logarithmic ends. We shall be interested here only] 


in that family of geodesic lines which lie in the logarithmic ends and pass through 
A, , the branch point over A,. Let P)” be the point of Q, lying over Po of the 
w-plane. Let the polar codrdinates of Po be (ro , 0%) (—2 < % S 7), where the 
pole is taken at w = 0. To determine the geodesic line in the logarithmic ends 
through A, and P;” , that curve C on W must be found which lies in the loga- 
rithmic ends, joins A; and Pj” , and for which the integral 


/ | dw | 

Cc | w | 

extended from A, to P%” is least. To this end consider the transformation of the 
logarithmic sheets by 


w = a+ iB = log w. 


This maps the logarithmic ends on the w-plane cut along the segment (log A; 
+ «). This cut is the image of the junction lines of the logarithmic ends with 
the algebraic sheets. The sheet Q, is mapped on the strip (2y — 1)r < 6 ¢§ 
(Qv + 1)x (v = O, +1, +2, ---). 

Let C be any path in the logarithmic ends joining A, and Py”, and let y be the 
image of C in the w-plane. y is a curve joining the points with Cartesian co 
ordinates (log A; , 0), (log ro , % + 2mv). Since C stays in the logarithmic ends, 
y does not cross the segment (log A,, + ~). The Euclidean length of y é 
given by the integral of | dw | alongy. But 


[ \ae| = [ a 


extended from A, to Pj”. Consequently the value of this latter integral will be 
least if and only if C is the image of the straight line of the w-plane joining the 
points (log A; , 0), (log ro, 0% + 2xv). This line is given parametrically, with 
parameter t, by 


To 


w = w(t) = (tog A; + tlog A ) + it(0) + 2rv). 
Al 
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The image E, in the w-plane is given parametrically by 


log r = log A: + ¢ log a 6 = t(0 + 2nv). 
Ali 


If v # 0, 0  O, this image will be the logarithmic spiral 


r=A exp | ° lo 2] 
Plat anv As!’ 


which becomes the circle r = A, if 79 = A;. In the special case vy = 0, = 0, 
the image will be the line 6 = 0. 

From the above discussion we have the following result :* 

The family of geodesic lines through the branch point over A; and lying in the 
logarithmic ends is the family of curves E, on W lying over the spirals E, . 


5. Geodesic lines in an algebraic sheet. We shall be interested here in those 
geodesic lines which lie in the algebraic sheet S, and join two points of S, lying 
om the same half-ray emanating from A,, the branch point over A,. If w — 
4, = re”, then 


‘) / | dw | ae / (dr? + r°d6’)' 
lw—A,|+c r+q — 


If (r: , 00), (v2 , %) are two points of S, on the same half-ray emanating from 4A, , 
and C is any path in S, joining these points 


(dr? + r°de*)' > / | dr | > / dr 
c r+, ~ dertao |Jer+e, 


But the integral on the right is independent of the path and has the value 
log (r2 + c,)/(71 + ¢,) |, which is the value of (4) taken along 6 = 4 from (r; , 4) 
to (r2 , 0). Hence we have the following: 

The geodesic line in S, which joins two points of S, lying on the same half-ray 
emanating from the branch point over A, is the straight line joining these points. 

The above conclusions are true if in particular r; = 0. We then have: 

The family of geodesic lines in S, through the branch point over A, is the family of 
traight lines through that point.’ 


‘If w = re*?, / | dw|/|w becomes | r-1(dr? + r2d@*)!. The extremals of this integral 


ure found to be the spirals of the family r = be**. The spiral of this family determined 
ty the sets of codérdinates (A: , 0), (ro, 60 + 2xv) is precisely the spiral EZ, found above. 

*The extremals of integral (4) through r = 0 are found to be the lines of the family 
'= constant. 











572 F. E. ULRICH 


6. Distances in the various sheets. The distance from A,, the branch point 
over A, , to Pj”, a point of the logarithmic sheet Q, , is given by 


2 2 32\4 
D(A, Pi”) = / (dr > rd") 
r 


where the integral is extended from A, to Pj” along the curve E,. If Py’ 
lies over the point roe”? (—2 < % S< x), along E, 


r = A, exp * log 4 
y 8 A, }’ 


where 0 = 6 + 2xv, provided 6 ~ 0. Evaluation of the integral expressing 
D(A, , P%”’) along this curve yields 
- (y) Wo 
(5) D(A1, Pi”) = | log = | 
A, 
where wp = re”. In the special case 0 = 0, EZ, lies over the line 6 = 0, and itis 
immediate that (5) is still correct. 
The distance from A, , the branch point over A, , to a point P, of the algebraic 
sheet S, is given by 
a 1 2 r 2\4 
pu.) « | Se" 
r+, 
where the integral is extended along the straight line joining A, and P, , and the 
pole of the system is at A,. If (ro , 6) are the coérdinates of P, , 


(6) D(A,, P,) = [ a log (1 + ”). 
o Tr + Cy Cy 


The junction lines between the logarithmic sheet Q) and the algebraic sheet S;, 
as well as the junction lines between successive algebraic sheets, are the loci of 
points which separate two regions in which there are different analytic expres- 
sions for the distance from the branch point from which the junction line ema- 
nates. For instance, if Py), a point of either of the junction lines between Q 
and S; , is considered as lying on a shore of Qp , the distance D from A, is given 
by (5), while if considered as lying on a shore of S, , that distance is given by (6) 
withy = 1. Inthe first case 


where po is the radius vector of Py with the pole over w = 0. If we use (6) to 
determine the distance D’ in the second case, ro is the radius vector of Po with 
the pole at A, : 

"a= pr” A;, 
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and we have 


D' = loe (1 + a= 4). 


C1 


But by the notation of (1) and (2), c: = po = Ai, and we again have 


D’ = log © = D. 
Ai 
If Po , a point of either of the junction lines between S, and S,_, , is considered 
as lying on the shore of S, , the distance D from A, to Pp is given by (6), where 
the radius vector of Po is measured from the pole at A, : 


D= log (1 +"), 


If Po is considered as lying on the shore of S,_; , the distance D’ from A, to P» is 
measured in the metric of S,,. But in this case A, and P» lie on the same half- 
ray emanating from the pole at A,.,. Therefore by the first of the two results 
stated at the end of the preceding section 


ro 2 2 yn2\4 
D’ = (dr° + rd6@) 
‘1 r+c 


extended along 6 = constant, where r; = | A, — Ay | = pp. and re = m+ py. 
Hence, on recalling the notation given in (1) and (2), we see that 


“t= To + Pra + G—-1 = _ a 
D’ = log ey ye log (1 + : D. 


Similar considerations show that the distance from A,,, to a point of either of 
the junction lines between S, and S,,; is the same whether measured in the metric 
of S, or of S,4:. 

In other words, the value of the distance from an algebraic branch point to a point 
of either of the junction lines emanating from that branch point is the same when 
measured in the metric of either of the sheets joined along that junction line. 


7. Geodesic lines on the surface. The problem of determining the shortest 
path on the surface from A, , the branch point over A; , to any point P of the 
surface now arises. Let G be the path on W defined as follows: If P is in the 
logarithmic sheet Q, , Gis the spiral Z, joining A, and P. If P is in the algebraic 
sheet S, , G is the geodesic line in S, through A, and P. If P is in the algebraic 
sheet S,, (vo 2 2), Gis the path obtained by going from A, along the geodesic 
line in S; to the branch point over Az , from this point along the geodesic line in 
S: to the branch point over A;, and so on, coming finally to the branch point 
over A Then go from this point along the geodesic line in S,, to P. 


vo ° 


Now let C be any curve on the surface joining A; and P, along which the left 
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member of (7) (see below) exists. Let the portion of C in any sheet lying over a 
finite region of the w-plane be rectifiable. It will be shown that 
- / |\dw| . / | dw 
(4) - = = 
c Aw) a Aw) 

Since C, with its end points, forms a closed set of points on W, it must lie in a 
finite number of sheets. In the first place, it is necessary to establish (7) only 
for those curves C such that the portion of C in any sheet lies over a bounded 
region of the w-plane, say over|wj|< R. For, the point at infinity is not 
accessible along any path of finite length in the metrie considered. From (5) 
it is clear that in the logarithmic sheets such distances become infinite like log 

w|. Along a path C in the algebraic sheet S joining (ro , %), (r, 9) 


[ a _ / (dr* + rede’)? > [ | dr | > [ dr 
c c r + Cy a ro T + Cy ii ro r + Cy 


This last integral is independent of the path and with increasing r becomes 
infinite like log r. 

Moreover, it is necessary to consider only curves C such that there are in an 
algebraic sheet S, but a finite number of segments of C which join a point of the 
junction line emanating from the branch point over A, with a point of the june- 
tion line emanating from the branch point over A,4,. For, the integral of de 
along any curve which has an infinite number of such segments is not bounded. 
On one such segment I, 

/ = / | dw = l , 
I ~  R+/A| +e R+|A,| +e 


where / is the Euclidean length of [. But for each such segment we have 
1 =| A, — A,1,, and so if there are an infinite number of such segments, the 





above statement follows at once. 

Suppose C crosses a junction line emanating from the algebraic branch point 
over A, an infinite number of times. All intersections of C with this junction 
line fall in a finite interval along the junction line. Moreover, there will be on 
the junction line a finite number of intervals 7, (kK = 1, ---,m) with the 
following properties: The endpoints of each J; are intersections of C with the 
junction line. In each J, there are an infinite number of intersections of C with 
the junction line, and the portion of C between the endpoints of J, lies 
entirely in the two sheets which are joined along this junction line. Exterior 
to the intervals J; there will be a finite number of intersections of C with the 
junction line. 

Let a, be the first point of 7; reached in traveling along C from A, to P, and 
8, the second. Then, since the segment J, is the geodesic line through a, and 
6, for either of the sheets joined along this junction line, 


Bk 
(8) / do 2 / de, 
atk Tk 
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where the integral on the left is extended along the portion of C between a, 
and Bx . 

Let C’ be the curve on W obtained from C by replacing the portion of C 
between each pair of points a, , 6, on all junction lines by the corresponding 
interval J,. From (8) it follows that 


[ a > | de 


From this it follows that it is necessary to establish the validity of (7) only 
for such curves C which may fall along certain junction lines throughout a finite 
number of finite intervals but otherwise meet junction lines emanating from 
algebraic branch points in but a finite number of points. 

Let C be such a curve joining A, and P. Let {7;} (¢ = 1, --- ,n) be the 
sequence of regions through which C passes, where a region 7’; is either a single 
algebraic sheet, or a sequence of logarithmic sheets such that in passing through 
the sequence along C, no two of the logarithmic sheets are separated by an 
algebraic sheet. Such a sequence of logarithmic sheets constituting a region 7’; , 
i < n, must have Qp as the first and last sheet. It may in particular reduce to 
the single sheet Qo. 7’: must be either the sheet S, or a sequence of logarithmic 
sheets. If P lies in the algebraic sheet S,,, 7, is the sheet S,,. If P lies in 
the logarithmic sheet Q,, , 7, is a sequence of logarithmic sheets, the first of 
which is Q and the last Q,,. In forming the sequence {7';}, if C coincides with 
a junction line emanating from an algebraic branch point throughout an interval, 
that portion of C can be considered as lying in either of the two sheets which 
are joined along that junction line. 

For 1 <i S n, let By. be the point at which C enters 7; ; then B;,i4:, 
i < n, will be the point at which C leaves 7; and enters 7';,,. Let Bo, be A, . 
If B,,,; lies on the junction line emanating from the branch point over A, , 
let G1; be the geodesic line through that branch point and B;.,;. This 
geodesic line will fall along the junction line, if considered in the metric of 
either 7; or T;,. Since Bo, coincides with A, , the integral of do along Go, 
is 0. If 7, is the algebraic sheet S,, let G; be the geodesic line in 7; through 
the branch point over A, and the branch point over A,,,. Let C; be the 
portion of C in T;. 

We shall now consider the integral of do along C;. In this connection it is 
convenient to distinguish several cases. 

Case 1. Those regions 7; for which B;1,; and B;,;,, lie on the same junction 
line. For all such 7;,7%< n. Any 7; consisting of a sequence of logarithmic 
sheets must come under this case. If B;_;,; lies between the branch point and 
By i4;, or By, coincides with By is: , 
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and 


(9) / do > | do =f do; 
Ci C t 


7i,itl Fi-1,i 


for, the geodesic line through B;,,; and B;,;;; in the metric of 7; falls along 
the junction line. 
On the other hand, if 


[ ds > de, 
G Git 


t—1,8 


(9) is still valid, for in that case the right member is negative. Hence for all 
regions 7’; considered under Case I, (9) obtains. 

Case Il. Those regions 7; for which By1,; and B;.i,; lie on different junction 
lines. Here again for each such region 7';,7 < n. Also, each region 7’; of 
this class is an algebraic sheet S,. To treat this case it is necessary to recog- 
nize two cases according as C enters S, over a junction line emanating from the 
branch point over A,, or a junction line emanating from the branch point 
over Avai ° 

In the first case, since the straight line through A, and B;,;:; is the geodesic 
line through these points, 


/ da + | do = / do + de , 
or 
(10) / da > | do + | de -| de. 


In the second case, since G;,i.; is the geodesic line through B;,:;,; and the 


do + [ de | de, 


Girl 


branch point over A, , we have 


/ da +/ 


76~—5,8 


or 


(11) / dao = - | 


There still remains for consideration 
Case III. The region T,. If P is in the logarithmic sheet Q, , let g, be the 
geodesic line #, in the logarithmic ends through A, and P. If P is in the 


algebraic sheet S,, let g, be the geodesic line in S, through P and the braneh 
point over A,. On considering the integral of do along C, , we shall first treat 


7i~1,8 


da +/ dg — / do. 


i 


the case in which 7, is a sequence of logarithmic sheets or the algebraic sheet 
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S, and C enters 7’, over a junction line emanating from the branch point over A, . 


In this case, we have 
[ do + do = / de, 
Cn 


Gn-in On 


or 


(12) [ ds = / de - | de. 


n Gn 7n—1,n 


If 7, is the algebraic sheet S, and C enters 7, over a junction line emanating 
from the branch point over A,,;, since g, is the geodesic line in 7’, through P 
and the branch point over A, , we have 


/ ao + [ do + [ da = | do, 


Gn 


or 


(13) [ de z - | 


n Gn-in 


da - | do +/ de. 
Gn On 


We then have that for each 1 < i S n, the integral / do satisfies one of 
Cy 


the five inequalities (9), (10), (11), (12), (13). 

In the following discussion, if any region 7’; is an algebraic sheet S, and C 
leaves this region over a junction line different from the one over which it 
entered the region, we shall say that C crosses the region 7; . 

In the first place we have that 


(14) [ ds = > [ do + [ de. 


We shall now replace each term of the summation on the right by the right 
member of the one of the three inequalities (9), (10) or (11) which is valid for 


the 7; in question. The result is 
da — / de), 
+1 Gi-ii 


Ef we B («fw +¥(, 


where >.’ denotes the deleted sum in which a term appears only for those 
values of ¢ for which either (10) or (11) is in force. Moreover, the sign of the 
term is positive when (10) is in force and negative when (11) is. 

From the result stated at the end of §6, it follows that 


| do for any index % is equal to [ do for the next greater index t + 1. 


‘iat Gir ti 
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Therefore, since the integral of do along G),; is zero, 


n—l 
=(/ do — [ do) = [ de. 
i=1 Giji+t Gi-1,i G 


If this is used in (15), (14) becomes 


(16) [a > > (+/ ie) +f do+f do 


It is again convenient to distinguish several cases. 

Case 1. P les in a logarithmic sheet. Then 7’, is a sequence of logarithmic 
sheets. In that case (12) is satisfied. Moreover, if P lies in the logarithmic 
sheet Q, , gn is the geodesic line £, , and (16) becomes 


(16:) [ ao = o(+/ ie) +f do. 


It should first be noticed that, for the case under consideration, if C crosses 
any algebraic sheet S,, contributing a term to the deleted sum, it crosses §, 
an even number of times, so that S, appears an even number of times in the 
sequence {7';}. For half of the regions 7';, which are the sheet S, , C enters T; 
from the logarithmic sheet Q) or an algebraic sheet of lower index, while for 
the other half, from an algebraic sheet of higher index. Consequently, for half 
of these regions (10) is in force while for the other half (11) is. Hence, corre- 
sponding to each term of the deleted sum which carries a negative sign, there 
is a term numerically equal with a positive sign, and 


n—l 

Sy (+/ de) = 0 

i=l Gy 
[ a >/ de. . 
Cc Ey, 


But for this case £, is precisely the path G along which the integral on the right 
side of (7) is extended, and hence (7) has been established if P is in a logarithmic 
sheet. 

Case II. P lies in an algebraic sheet S,, and C enters S,, from S,,1.° In 


















(16;) then becomes 








this case [ do again satisfies (12). If we let g,, be the geodesic line in S,, 





through P and the branch point over A,, , g,, is identical with g, , and (16) 
becomes 


n—l 
Tw ty [ a. 
(16) [wz d'(+] a) + 4 o 


° If vo = 1, C enters S; from Qo. 
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For the case under consideration C must cross each algebraic sheet S, 
(1 S v < ») an odd number of times and an algebraic sheet S, (v = ») an even 
number of times. Consequently a sheet S, (1 S v < ») appears in the sequence 
{7;} an odd number of times, say 2m + 1. In m of these regions which are the 
sheet S,, C enters S, from S,,; so that (11) is in force, while in the remaining 
m+ 1, C enters S, from S,_; , or Qo if »% = 1, so that (10) is in force. Hence, 
if we consider the terms of the deleted sum for which the region 7’; is an alge- 
braic sheet S, (1 S »v < ») corresponding to each term with a negative sign, 
there will be a term numerically equal with a positive sign. In addition there 
will be just one remaining positive term. Consequently the terms of the 
deleted sum for which the 7; are algebraic sheets S, (1 S v < ») add up to 


= 
oe A do, 
v=l 
where G, is the geodesic line in S, through the branch point over A, and the 
branch point over A,4:. 

For the terms of the deleted sum for which 7’; is an algebraic sheet S, (v = v0), 
since each such S, is crossed by C an even number of times, the discussion is 
exactly the same as that of the deleted sum in Case I. Therefore the sum of 
those terms is zero, and we have 


Y' (+ «) - = [ «. 


If this is used in (162), we have 


vo—l 
[doz x [ a+ f ray 


But when P lies in S,, the path made up of G; , Gz, --- , G,,-1 , g», is precisely 
the path G along which the integral in the right member of (7) is extended, 
and with this (7) has been proved in the present case. 

There remains for consideration 

Case III. P lies in S,, and C enters S,, from S,,4:. Here (13) is satisfied. 
In the notation of the preceding case, g, is g,,,andG,isG,,. (16) then becomes 


(163) [a = Y (+ «) _ [ do + [ de 


In this case C crosses each algebraic sheet S, (1 S » S ») an odd number of 
times and an algebraic sheet S, (v > ») an even number of times. The dis- 
cussion of the deleted sum is exactly the same as that of Case II, with the 
modification that here the sum of the integrals extends from »v = 1 to vy = »% 
instead of from »y = 1 tov = » — 1 asin Case II. Consequently (163) becomes 


[az 4 do — f do +f a= F [dot fi ae 


v=! 
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But again this is precisely inequality (7), and with this the proof of (7) is com- 
plete in all cases. 

The proof of (7) establishes the fact that the path G through A, , the branch 
point over A; , and any point P of the surface is the geodesic line on the surface 
through A; and P. 


8. Distance from the branch point over A, to the branch point over A,,. 
From the result of the preceding section, it follows that the distance from the 
branch point over A; to the branch point over A,, is given by 


vo—l1 
nA, 4.)« ¥ [ de. 
v=1 Gy 
But in the notation given by (1) and (2), 
eS Dy Cyst 
] = [ = ] = = é 
[ do Le og (1 + ®) log “ 


¥ 


and we have 


vg—1 
(17) D(A;, A,,.) = ¥ log % = log @ 
v=l1 Cy Po 


9. Lengths of non-Euclidean circles in the various sheets. Consider the 
locus Ko, in the logarithmic ends, of all points P of W which are at the non- 
Euclidean distance p from the branch point over A,. If P is given by w = re”, 
from (5) this locus is given by | log (w/A1) | = p. This curve on W lies over 
the curve 


r= A,exp[+(p — 6°)'] 


of the w-plane. If Ko is the branch of this curve given by the positive sign 
yl . = . . . 
and Ko that by the negative, the non-Euclidean length of Ko , is given by 


7 dé 


that is, 
(18) Lo(p) = 2rp. 


Now consider the locus K, , in the algebraic sheet S, , of points P which are 
at a non-Euclidean distance p from the branch point over A,. If (r, 6) are 
the codrdinates of P, where the pole is taken at the branch point over A, , it is 
found from (6) that this locus is given by r = c,(e’ — 1). Let L,(p) be the 
non-Euclidean length of K,. Then 


(19) Lp) = (1 — e *) [ dé = 2nr(1 —e *). 
0 
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10. The regions W, on the surface and the function L(p). Let W, be that 
region of the surface W which consists of those points of W whose N.E.-distance 
from the branch point over A; does not exceed the positive number p. Let I, 
be the boundary of W,. TI, is obtained by measuring out a distance p along 
all geodesic lines on W through A,. If in this process a branch point is en- 
countered, the measuring must be continued out along all geodesic lines through 
that branch point in all sheets intertwined at that point until the complete 
distance p is attained. It follows from (17) and the result of §7 that if 
log (c,,/Po) < p < log (¢,,4:/po), T, consists of » + 1 N.E.-circles. First there is 
Ky in the logarithmic ends with N.E.-radius p; then for each 1 S v S vw, there 
is K, in S, with N.E.-radius p — log (c,/po). Therefore, if L(p) is the length 
of the boundary of W, , from (18) and (19) 


(20) L(p) = 2rp + > 2r [ — exp (-. + log “) 
v=1 0 


It is convenient to represent the summation in (20) by a Stieltjes integral. 
Let n(t) be the number of algebraic branch points of the surface at a N.E.- 
distance less than ¢t from A,. n(t) is a step-function with discontinuities at 
the points ¢ = log (c,/po) (v = 1, 2,3, ---). At each of these points the saltus 
is +1: 

n(t) = », log (c,/po) < t S log (€,41/po) (» = 1,23, .---). 


The Stieltjes integral 
2r [ (1 — e& °*) dnt) 
“0 
exists and is equal to the summation appearing in (20), where log (c,,/po) < 


p & log (¢y,+1/po). 
The expression for L(p) then becomes 


L(p) = 2x E + | (a-—-e’) an(d |, 
0 
and after an integration by parts 


(21) L(p) = 2x E + | n(t)e eat]. 


11. Sufficient conditions that the surface be parabolic. In order to obtain 
sufficient conditions that W be parabolic, the following lemma will be useful: 


LemMa I. Jf f(x) and g(x) are two positive monotone increasing functions of 
the real variable x defined for all x > 0, and if g(x) = O(x) as x becomes infinite, 


the integrals 


7 dx ad dx 
) 
| f@) +o) “™ J a =e 


diverge together. 
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It is at once evident that the second of these integrals diverges if the first 
does. In order to prove that the first diverges whenever the second does, we 
first suppose there exist positive constants 2 and m such that f(z)/g(z) > m 
forz > 2%. Then 


1 > m 1 
f(z) + g(x) © m+1 f(z) 
for x > 2, and from this the result follows. 

If this condition on f(x)/g(z) is not met, there exists an infinite sequence 
‘r,}, where z, > © as n — ©, such that f(z,)/g(z.) <n for each positive 
integer n. But then, due to the monotone character of f(x) and g(z), it follows 
that 


7 dz > _*_,_% 
ten f(z) + g(x) © n+1 2gG(an)” 
Since g(x) = O(z), the right member of (22) remains greater than some positive 
constant independent of n. But (22) then contradicts the Cauchy condition 
for the convergence of the first integral in the lemma. With this the proof of 
the lemma is complete. 

A sufficient condition that W be parabolic can now be obtained. Since n(t) 
is a monotone increasing function of t, from (21) it follows that 


(22) 


L(p) < 2x le + n(p) "jpg at] < 2rlp + n(p)] 


and 
ben es 
L(p) © 2x[p + n(p)] 
Hence the integral 
/ a will diverge if / are diverges. 


By Lemma I we can then say that 


“dp .. ‘ F dp 

——— diverges if —~ does. 
Lip) * n(p) 

On the other hand, from (21) we also have 


L(p) > 2x E + n(4p) [ eat] = 2nlp + n(}p)(1 — e”)). 


Hence, for sufficiently large p, 


] ] 
L(p) 7 x(2p + n($p)]’ 
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from which it follows that 


"x or" . dp 
if / CP diver es, so also does / ——_—____ , 
L(p) OS Ep + 0) 


From Lemma I it can then be concluded that 


* dp .. , [ dp 
diverges if — does. 
J ne) L(p) 

From the criterion of Ahlfors stated in the introduction we now have 

THEOREM I. Let W be an open simply-connected Riemann surface of the class 
defined in §2. <A sufficient condition that W be parabolic is the divergence of the 
integral 

© dp 
n(p)” 

Since it has been proved that the divergence of the integral of Theorem I is 
entirely equivalent to the divergence of the above integral, the result stated in 
Theorem I is the best which can be obtained from the metric considered. 

A test in terms of the divergence of a series will be useful. Let 

py = log (c,/po) (» = 1,2, 3, --- ). 


By (1) and (2) c, = po, so that p, = 0. Moreover, n(p) = 1,0 <p Spm. 
Therefore 


i. ray Saad fr sy" 


After an integration of the integral on the right by parts, we have 


Pro dp p Pro 1 
(23) i = fe _ [pa] I. 
0 ne) m—1 J, ” Ln) 
The function 1/n(p) is a step-function with discontinuities at the points p, 
(» = 1, 2,3,---). The saltus at p, is »’ — (» — 1)". Consequently 


Pro 1 vo—l p» 
d =-— ——= ; 
I : EA 2 ie — 1) 
and (23) becomes 


Pro d "oO p 
(24 | p — Py "0 . 
0 Nn(p) 2 v(v — 1) + Yo 
Since each p, (v = 2) is positive, it is immediate from (24) that if the series 


2 


Pr 
y=2 v(v — 1) 
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diverges, the integral 
© d 
n(p) 
will also. 

On the other hand, if the integral diverges, the quantity on the right side 
of (24) must become infinite with increasing ». But, since the summation 
appearing in (24) is a monotone increasing function of » , it can be concluded 
that at least one of the terms on the right must become infinite. 

Now, if p,,/v — © as % — ~, corresponding to any constant M > 0, there 
is a positive number N > 2 such that p,,/v > M for » > N, and hence 


Pe M 


f N. 
vo(vo — 1) d vy — | aside 


But then the series diverges. 

If p,,/vo does not become infinite with increasing vp , the sum of the series to 
vo terms must; that is, the series diverges. So in any case if the integral diverges, 
the series does also. 

With this it has been proved that the divergence of the series is equivalent 
to the divergence of the integral. But this series is comparable to the series 

2 

> %. 

y= v* 
Hence we have 

THeoreM II. A sufficient condition that W be of parabolic type is the divergence 
of the series 


From Theorem II there can be obtained some information concerning the 
asymptotic behavior of the quantities | A, | which will guarantee that W be 
parabolic. In the first place it will be shown that if | A,| = O(v"), where n 
is a positive integer, the series of Theorem II converges. In that case there 
a positive constant M such that | A,| < M>" for all vy. Then 


_— 
7p 


oT. 


YC=Ppttmat:--+ pa 
= 2/Ai:|+---+2|/Au1|+/|A,| < 2M(1" + 2" +.--- +2"). 


But, the sum of the n-th powers of the first v integers is a polynomial of degree 

° j n+l . os 

n+ 1 inv, so that c,/pyo = O(v""’) and log (c,/po) = O (log v). From this it 
wo 

follows that the series of Theorem II is dominated by the series > kv ® log », 
vel 

where k is a positive constant. This latter series converges and hence the 

series of Theorem II does also. In other words, in order to have the present 

conditions for the parabolic case met, the points A, over which the algebraic 
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branch points of the surface lie must spread out more rapidly than any 


power of v. 
On the other hand, it will now be shown that if 


|A,|~ \ ex | +t | 
| ™ Tog @ + 1) PL log @ + 1) J’ 


the series of Theorem II diverges. In the first place, it is clear that the deriva- 
tive of (x + 1)/log (2 + 1) is asymptotically equal to 1/log (x + 1). From 


this we have 
+ 1 y+i1 
A,|~D,| —* , 
ws E (v+  fex E (vy + 5 | 


Therefore for any positive constant m’ < 1, there exists an integer » > 1 
such that 
+1 v+1 
| A, | a x 
Aol > ON St hl lies DD 
Cy 


oe lA +--- +2| Aa] +4) 
Po Po 


, k+1 | | k+1 | 
die es 2, pe ©*P | jog (k + 1) |’ 


where K = (2/po)(| A1| + --- + | A,, |) and m = 2m’/po. 
If f(z) is a positive monotone increasing function defined for x > 0, 


Ss) =f se) ae. 


Then 


k=vot+l1 
Therefore 
c v 
“> M + mexp 
Po log v 
where 


vw + 1 
log (ve + 1) 


But then there is an integer vo > vo such that for vy > v 


M = K — mexp | | a constant. 


=> _ ex] ; 
— m exp , 
Po 2 log v 
. _ ts 
Hence the series of Theorem II for » > vo is term by term greater than the 
series 


x (, log v + » log 2 m). 
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But this series is comparable to a divergent series. Therefore the series of 
Theorem II diverges and the surface is parabolic. 

From the above discussion it is clear that in order to obtain this result it is 
only necessary that there exist a positive constant m and a positive integer » 
such that for »v > v 


m yv+1 
| A,| > log (v + 1) _ E= + 5}: 


This result is of the nature of a comparison test. The above facts will be 
stated as 


TuHeoreM III. Jn order that the conditions for the parabolic case expressed in 
Theorems I and II be met, | A, | must become infinite more rapidly than any power 
of v. But, if there exist a positive constant m and a positive integer vo such that 
forv > w% 


, m y+i1 
41> Gan E + mI 


the surface W is of parabolic type.’ 
Other series the divergence of which is equivalent to the divergence of the 
series of Theorem II can be obtained. Two such series are 


pe. log (1 + p,/e,) and “la + p,/e)” — 1). 
y=l v=l 


12. The Gamma surface. Let W be the open simply-connected Riemann 
surface on which the finite z-plane is mapped by 

1 
D(z)’ 


where I'(z) is the Gamma function defined by the Weierstrass product 


1 _ yz = 4 —2z/n 
ty = "(TE (1 4+ 2)e"], 


y being the Euler constant. 

This surface has a logarithmic branch point over w = 0 along with an infinite 
number of smooth sheets, and a transcendental singularity over w = © which 
is a limit point of algebraic branch points of first order. There are no other 
singularities.” 


w = w(z) = 


7 See footnote 3. 

* For a discussion of these facts concerning the general structure of this surface see 
A. A. Utzinger, Die reellen Ziige der Riemann’schen Zetafunktion, Inauguraldissertation, 
Ziirich (1934). In section III of this paper the surface defined by w = ['(z) is discussed, 
and the above facts follow at once. 
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The algebraic branch points of the surface lie over the points of the w-plane 
corresponding to the zeros of 


_r@_ ¥@ 
re) ~ — T@)’ 
where ¥(z) = D, log ['(z). w’(z) has a removable singularity at each of the 
poles of I'(z) and when properly defined is not zero at these points.” The 
zeros of w’(z) coincide with the zeros of y(z). 
The zeros of ¥(z) are all simple and lie on the real axis. There is a single 
positive zero: a; = 1.4616---. There is in each of the intervals (— v + 1, 
—y + 2) (v = 2,3, 4, --- ) one zero a,. We can write” 


a, = —y+1+h,, 


w'(z) = 


where 
be = ey > and lim 4, = 0. 
If we use the Euler formula ['(z)'(1 — z) = 2/sin rz, we get 
. v—1 
etn) = a Ee SE ema sé 
T T 
Since 
. sin wh, _ 
=a 
\we) | ~aro — a) ~ 
log v 
Therefore lim | w(a,)| = ~. 


If we let A, = w(a,), the points of the set {A,} lie on the real axis of the 
w-plane since the Gamma function is real for real argument. Moreover, A, > 0 
fory odd and < Oforveven. The set {A,} constitutes the points of the w-plane 
over which the algebraic branch points of the surface lie. Since each of the 
algebraic branch points is of first order, the Gamma surface belongs to the 
class defined in §2. We already had above that 


(25) 1A, | ~ Ee) | 
log v 


If m and M are any two constants such that 0 < m < 1 < M, there exists 
an integer v; > 4such that for » > »; 


r'(v — h,) 
m 
log v 


(v i h,) 


<|4,.|<M! 
log v 


* See, e.g., N. Nielsen, Handbuch der Theorie der Gammafunktion, 1906. 
1° This result is due to Hermite, Journal fiir Mathematik, vol. 90(1881), pp. 332-338. 
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Also, since the h,’s tend ito zero with increasing v, there exists a positive integer 
"1 such that forvy > »;, h, < 4. Then if » is an integer greater than both 
vy, and »;, fory > » 


2 | Ay | aa see a 2 | es | K 2M mat log a Th oa k) 
| A, | al § fg m > log (1 + k) T'(v — 9)’ 


where K = 2|A,| + --- + 2|A,, |, a constant. Now, for» > 2 


(26) 


a 2 a log v (ny + k) a a log v rv — 1) log v T (v — 2) 
1 log(i+k) Tw—4) log@—1) Tw— 4) log — 2) “Te v—}) 
log v _T@ - 3) aH" log v Th +h) 


log(v— 3) Two—}%)' & log(itkh Tw—}’ 
ro —-3)=0-He-H)e-DIO-D) 


and for each set of values satisfying 1 S$ k S v — », — 4 


r(y + k) <1. 
rv — $) 
Therefore 
log v Tmt k) log v 


log(in+k) TO-%) %-HO-HO-D 
Moreover, there is a positive integer v2 such that for v > ve 
log v log v log v 9 
log(v—1) log(v—2) = log (v — 3) ; 
Hence, if » is an integer greater than both 2», and », for v > 
mh log v : T'(y, + k) 6ry — 1) (v — » — 4) logy 
i=1 log(u +k) Tw-—3) Te-}) @-DHO-HO-D 


But each term on the right tends toward zero as vy — «. Therefore, since 
K/\ A, | also approaches zero with increasing v, we have from (26) that 


i 2|Ai|+- *+2|Aa|_ 
im 


yee | A, 


By definition, c, = 2| A:| + --- +2|A,1| +|A,|. Then from the limit 


above 


and Fi A, poe 
and 
” I'(v — h,) 
‘ logy —~ 


But then 
log (c,/po) ~ log T'(v — h,) — log log »v. 
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By the Stirling formula,” 


log I'(v — hy) = (v — hy — 4) log (v — h,) — » + hy + log (2)! + ae x’ 
where 0 S 6 <1. From this it follows that 
(27) log (c,/po) ~ v log v. 
The function n(p) is expressed as follows: 
n(p) = », log (c,/po) < p & log (¢,41/po) (»= 1,23, .--). 


From (27), log (c,/po) = (1 + 1#,)v log v, where tj, ~ 0 asv— «. Therefore 
(1 + t,)n(p) log n(p) < p S (1 + t41)[n(p) + 1] log [n(p) + 1], 


and 


(1 + #,) log n(p) 
log (1 + tir) + log [n(p) + 1] + log log [n(p) + 1] 


p/logp — (1 + tai)L1 + 1/n(p)] log [n(p) + 1] 
n(p) log (1 + t,) + log n(p) + log log n(p)’ 


The two end members of this inequality tend toward 1 as p — «, and hence 


p 


(28) n(p) ~ ry 
og p 


From (28) it follows that the integral of Theorem I is comparable to 
x 
log pd : : ‘ he ; 
| BPC? which diverges. Consequently the condition for the parabolic 
p 
case expressed in Theorem I is met by the Gamma surface. From (27) it 
io) 
, —_ ; , logy _,. 
follows that the series of Theorem II is comparable to the series p B which 
v=l Vv 
is divergent, so that the condition of Theorem II is also met by the Gamma 
surface. 
The same is true of Theorem III; for, from (25) 


|A, | lim 


ro y+] / a pe y+ 
wp] +8, | log @ + 1) me 


Hence for »v sufficiently large 


m (vy + 1) 
| A, | *X -|, 
i? ween ten] 


where m is any positive constant, and so by Theorem III the surface is parabolic. 


Tue Rice INstTiruTe. 


"See Nielsen, loc. cit., p. 91. 








SOME PROPERTIES OF POLYNOMIAL SETS OF TYPE ZERO 
By I. M. SHEFFER 


1. Introduction. Pincherle,’ in his study of the difference equation 


k 

LD cnd(x + he) = f(x), 

p— 
was led to consider a set of Appell polynomials, in infinite series of which solutions 
could be represented. We considered the same equation’ by means of a different 
Appell set, the change resulting in a significant alteration of the regions of con- 
vergence (for the series). This permitted an enlargement of the class of fune- 
tions f(x) for which a solution could be shown to exist. Recently we treated the 
more general equation’ (linear differential equation of infinite order) 


Lily] = ay + ay’ + --- = f(z), 


where, under suitable conditions on L and f, a solution was found. Here, too, it 
was possible to relate the equation to a corresponding problem of expanding 
functions in series of Appell polynomials. It is this close relation to functional 
equations that adds interest to the study of Appell sets. 

As is well known, Appell sets {P,(x)} (n = 0, 1, ---) are characterized by 
either of the equivalent conditions 


(1.1) Pi(z) = Prx(z) (Pa polynomial of degree n); 
(1.2) A(te* = 2 P,(a)t’, 
0 


where A(t) ~ >> a,t” is a formal power series, and where the product on the left 
of (1.2) is formally expanded in a power series in accordance with the Cauchy 
rule. We shall say that the series A(t) is the determining series for the set {P,|. 


For the particular equation 
y(z + 1) — y(x) = f(z), 


Pincherle used the Appell set with A(t) = 1/(e‘ — 1), getting essentially the 
Bernoulli polynomials. We used A(t) = e‘ — 1, so that n!P,(z) = (2 + 1)"- 
z". Now this equation is also associated with the important set of Newton 
polynomials 


Received February 6, 1939; in preliminary form this paper was presented to the American 
Mathematical Society, April, 1936 under the title A characterization of a class of polynomials. 

' Acta Mathematica, vol. 48(1926), pp. 279-304. 

? Trans. Amer. Math. Soc., vol. 39(1936), pp. 345-379, and vol. 41(1937), pp. 153-159. 

* This Journal, vol. 3(1937), pp. 593-609. 
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a(t —1)--- (x —n +1) 


n! 


(1.3) N(x) = 1, N,(2) = (n = 1, 2, --+), 
which is not an Appell set. Yet, it has properties analogous to those ((1.1) and 
(1.2)) of Appell polynomials. In fact, 


(1.4) AN, (x) = N(x + 1) — N(x) = Na-r(2), 
(1.5) (1 + t)” = er me Ate) _ > N,(2)t". 
0 


It is thus suggested that we define a class of difference polynomial sets, of which 
|N,} is a particular set, by means of the relations 


(1.6) AP, (x) = Pra(z) (n = 0, 1, 2, ---). 


And more generally, we can use other operators than d/dz and A, to define 
further sets. We thus obtain all polynomial sets of type zero (as we denote 
them). The definition of sets of type zero generalizes readily to give sets of 
type one, two, --- , and of infinite type. (This is done immediately after rela- 
tion (1.15).) This hierarchy of types is all-inclusive, in that every set of poly- 
nomials is of a definite type. 

The main purpose of this paper is to bring to attention these sets of type zero 
and to indicate some of their properties. This section considers sets in general. 
§2 obtains various characterizations of zero type sets. Then, in §3, a study is 
made of the conditions on a set of zero type in order that it satisfy certain 
functional equations of finite order. As there are some known Tchebycheff 
sets that are of type zero, we next (§4) determine all zero type sets that are 
Tchebycheff sets. Lastly, in §5, we examine some extensions of the definition 
of type to type of higher order. 

By a set of polynomials {P,(x)} (n = 0, 1, 2, --- ) we shall mean a sequence 
in which each P,, is of degree exactly n. We shall denote the set {P,} by 2’. 


Lemma 1.1. Let J be a linear operator applicable to the functions x" (n = 


0, 1,---) (and hence to all polynomials) and such that J[{x"| is a polynomial of 
degree not exceeding n. Then J has the form 
(1.7) Jly(x)] = > L,(x)y’ (x), 

0 


valid for all polynomials, where L,,(x) is a polynomial of degree not exceeding n. 


To see this, define the L,(x) recurrently by the relations 
(1.8) J{z"| = p L.(x)-n(n — 1) «++ (n —k + 1)a"™ (n = 0,1, ---). 
k=0 


Since the degree of J[x"] does not exceed n, the degree of the L,’s are seen not to 
exceed their index. By construction, (1.7) now holds for y(z) = 2", and there- 


fore for all polynomials. 
Of special interest is the case where J[x"] is always of degree n — 1. 
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LemMa 1.2. In order that the operator (1.7) carry every polynomial into one 
whose degree is less by precisely‘ one, it is necessary and sufficient that 


(1.9) Lo(x) = 0, L,(z) = Lao + Lax + _— + Low (n = S 2, eee ) 
and 
(1.10) An = Nh + n(n — L)ln + --- + nblani # 0 (n = 1,2,---), 


First suppose that J[1] = 0 and J[z”] is a polynomial of degree n — | 
(n = 1, 2,---). From (1.8) we find that the coefficients of x" and 2” in 
J[x"] are respectively 


loo + nly + n(n — I)len + --- + nan, An (n = 0,1, ---). 


Taking n = 0, 1, --- successively, we see that l;; = 0 (¢ = 0, 1, --- ), so that 
L,(2) is of degree less than n; and in order that L[x”] be of degree exactly n — 1, 
it is necessary that A, ~ 0. The conditions are thus necessary, and it is readily 
seen that they are also sufficient. 

We shall assume without further mention that the operators with which we 
deal are of type (1.8) and that they fulfill the conditions of Lemma 1.2, so that 
they have the form 


oo 


(1.11) Thy) = Do lao + +++ + Inman” )y (2) 
with A, ~ 0 (n = 1, 2,--- ). 
THEOREM 1.1. Let P: {P,(x)} be a given set. There is a unique operator J 
for which 
(1.12) J([P.] = Pai (n = 0,1,---). 


If y(z) = P, (n = 1, 2, --- ) is substituted into (1.11), it is found that the 
l;;’s exist to make (1.12) true and are uniquely determined. This is the assertion 
of the theorem. If P satisfies (1.12) we shall say that P corresponds to the 
operator J. Conversely, we have 

THEOREM 1.2. To each operator J correspond infinitely many sets P for which 
(1.12) holds. In particular, one and only one of these sets (which we call the 
basic set and denote by {B,}) is such that’ 


(1.13) B(x) = 1; B,(0) = 0, n> 0. 


If Q is any polynomial of degree s, it is found by direct substitution that a 
polynomial P exists, unique to within an additive constant, such that J[P] = Q; 


‘ It is understood that J[c] = 0 for every constant c. 

5 The set {B,} is the “best approximation” set relative to the sequence of operators 
J°, J', J*, «++ according to the definition in the American Journal of Mathematics, vol. 
57(1935), especially p. 593. 
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and P is of degree s + 1. Choosing Bo(x) = 1, we can then successively (and 
uniquely) determine B,, B;, --- to satisfy J[B,] = Bn, B,(0) = 0 (n > 0). 
Moreover, B, is of degree exactly n. We thus have the existence of the basic 
se. That infinitely many sets exist is a consequence of the additive constant 
that is arbitrary. In fact, we have 


CoROLLARY 1.1. A necessary and sufficient condition that P be a set corre- 
sponding to J is that there exist a sequence of numbers {a,} such that 


(1.14) P,(x) = aoB, (xr) + a Bri(z) + --- + anBo(z) (a9 * 0). 
Here the B,’s form the basic set for J. 
If P satisfies (1.14), then P, is of degree n, so that Pisa set. Again, 


J[P,] = > a;J(B,-] = y a;Bn-i- ; 


so that J[P,] = P,1. This proves the sufficiency. 
To establish the necessity, we first observe that constants {a,;} exist so that 


P,(z) = anoB, (x) + --- + GnnBo(z). 
From the relation J[P,] = P,_; it follows that 
An0Bra +--+ + Ann1Bo = Gn10Bni + +--+ + Qn-t.n1Bo, 
so that 
Qs * Gets (j = 0,1,---,n—1). 
For a fixed j, this says that for every n 2 J, all a,;’s with second index j are 
equal. It is therefore permissible to drop the first index of each a,; ; which 


means that {a;} exists so that P, is given by (1.14). 
It can likewise be shown that 


Coro.tuary 1.2. Jf P is a set corresponding to J, then a necessary and suffi- 
cient condition that {Q,} correspond to J is that constants \b,} exist so that 


(1.15) Qn = bP an + OyPai +--+ + OnPo. 

DeFiniTIon. Let J be the (unique) operator corresponding to a given set P. 
P is of type k if in (1.11) no coefficient L,(x) is of degree exceeding k, but at 
least one is of degree k. If the degrees of the coefficients L,(x) are unbounded, 
then P is of infinite type. 

From Theorem 1.2 follows 


Coro.uary 1.3. There are infinitely many sets for every type ( finite or infinite). 


It is of interest to ask under what alterations, either of the set P itself or of 
the operator J which defines the type of P, the type is preserved. We consider 
two simple cases. 

(i) Suppose P, is replaced by c,P,, where c, # 0 (n = 0,1,---). Sucha 
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transformation does not affect convergence properties of series in these poly- 
nomials, but it can very well change the type, as is readily established. 
(ii) Let the operator 


(1.16) Kly] = ky’ + key” + --- (ki # 0) 


be given. The following can be shown (analogously to Theorem 1.1): Jf P is 
any set, there exists a unique operator J x of form 


(1.17) J xly] _ y » (no + Lax + Se + lnnax” )K"[y] 
n=l 


(where K" means K[|K"‘[y]]), such that 
(1.18) J x(P,] = Pont . 
Moreover, 


(1.19) An = nlok; + n(n — Ilaki + --- + nanak? #0 (n = 1,2,---). 


There is also an analogue to Theorem 1.2. 

Now suppose that we define the type of a set by the degrees of the polynomials 
L,(xz) in (1.17). It is seen that no matter what operator K (of form (1.16)) is 
used, the type of P is the same. 


2. On sets of type zero. Especially simple and important are sets of type 
zero. We shall find several characterizations for such sets. It will be con- 
venient to restate the condition for a set of type zero as follows: P is of type 
zero uf 


(2.1) J(P,] - Po (n = 0, l, 2, ‘eo ); 
where 
(2.2) Sly) = cry’ + ery” + cay!” + --- (c, # 0). 


DEFINITION. The formal series 
(2.3) J(t) ~ et + cof? + of’ + --- 


will be called the generating series (or function) for the operator (2.2). 

That Appell sets are of type zero follows from the fact that the generating 
series is J(t) = t. Similarly, for Newton polynomials (and for all the difference 
sets—cf. (1.6)), J(t) = e' — 1. 

Let P be of type zero, corresponding to the operator J. Let the formal 
power series inverse of (2.3) be 


(2.4) H(t) x st + Sf + --- (s, = c;' ¥ 0), 
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obtained from® 
(2.5) J(H(t)) ~ H(J()) =. 


If e is raised to the power zH(t), the expression will have a formal power 
series expansion in t, in which the coefficient of t” involves only 8, --- , 8, . 


On multiplying by the formal power series’ 
(2.6) A(t) = )) ant" (ao ¥ 0), 
0 


a new series (in ¢) is obtained, in which the coefficient of t” now involves only 
&,+++,4nj%8,-+-,8n. In fact, this coefficient is a polynomial in z of degree 
n, and we have, furthermore, 


THEOREM 2.1. A necessary and sufficient condition that P be of type zero 
corresponding to the operator J of (2.2) is that \a,} exist so that 


(2.7) A (te = px P,,(x)t". 
n=0 


From (1.14) of Corollary 1.1 it is seen that both the necessary and sufficient 
parts will follow if we can show that for the basic set | B,} (corresponding to J) 
we have 


(2.8) eM ~ > B,(2)t". 


Let exp {xH(t)} have the expansion >> C,(x)t". Then C,(z) is a polynomial 
of degree exactly n. On setting z = 0, we obtain 1 ~ >> C,(0)t", so that 
C0) = Co(x) = 1, C,(0) = 0 (n > 0). By Theorem 1.2, {C,} will therefore 
be the basic set if we establish the relation J[C,] = Cr (n = 0, 1,--- ). 
Operate on the C,(z)-series with J. This gives 


Zz J(C,]t" = Jlexp {xH(t)}] ~ lal + eH*® + -.-- }-exp {rH} 
0 


= J(H(t))-exp {rH} = t-exp {rH} = ZZ. Cr-i(x)t"; 


‘If the series for J(t) is formally substituted for ¢ in (2.4), and coefficients combined 
(in the usual way) to form a single power series in ¢, the coefficient of ¢" is for each na 
polynomial in c;, C2, --* ,€n, 81, °**,8n. It is possible to choose s, recurrently and 
uniquely as a simple function of c; , «++ , Cn, 81, *** » 8a-1, 80 that the power series reduces 
to the single term ¢. This sequence of s;’s is the one to be used in (2.4). 

’ The condition ap ~ 0 is to insure that P,(z) in (2.7) is of degree n and not less. But 
% ~ (0 is no essential restriction. See, for example, the footnote on page 916 of Bull. 
Amer. Math. Soc., vol. 41(1935). 
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and on comparing like powers of ¢t, we obtain J[C,] = Cn».. Thus, (2.8), 
holds.” 


Coro.uary 2.1. Jn (2.7) the numbers {a,} are the same as in (1.14). 


Thus, for Appell sets, (2.7) holds with H(t) = ¢ (ef. (1.2)), and for difference 
sets (including the Newton polynomials), H(t) = log (1 + #), so that a necessary 
and sufficient condition for a difference set P is that 


(2.9) A()(1 + =)” = Dd P,(x)t". 
Another familiar set of polynomials of type zero is given by the Laguerre 


polynomials which satisfy the relation 


1 —tz n 
(2.10) pay ‘ = } = > L.(z)t*. 


For this set, 
io) 


AW =, : » HO =J = , -De, 


so that 


Lea(z) = -(Le + Le t+Le +-::-). 


® Some words are in order regarding the validity of the above proof, which uses formal 
series; particularly, since like arguments (as well as obvious modifications) will be used 
again. Let k be any positive integer. Consider the operator 


Jily] = exy’ + «++ + xy 
and the generating series 

Je(t) = cyt + +++ + cyt. 
Let H;,(t) be the inverse of J;(t): 

J(Hi(t)) = Hi(Je(t)) = 6, 


and define Cy,(z) by the convergent expansion 


exp {zH,(t)} = > Cen(x)t". 
n=0 


The argument advanced above is now completely legitimate, giving the relations 
JilCen(z)| = Cen-1(z). 


Now it is readily seen that the series for H(t) and for H,(t) agree through the term in 
t*, whence the same is true for the two series for exp {zH(t)} and exp {zH,(t)}. This means 
that Cun(z) = C,(z) (n = 0,1, ---, k). As kis arbitrary, it follows that J[C,] = Cy-. for 
all n. This establishes (2.8) and, therefore, (2.7). 

Having thus shown that the use of formal power series yields the correct result (in the 
above case), we shall not hesitate to use such power series in what follows, leaving the 
argument in the present footnote as a guide to further ‘‘validity proofs’. 
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Every set satisfies infinitely many linear functional equations. One of the 
simplest for sets of zero type is given by 


THEOREM 2.2. Let P be of type zero corresponding to operator J, and let A(t) 


be its determining series. Then P satisfies the equation 
(2.11) . Lly(x)] = »» (qo + xqu)J‘ly] = dy, 
where’ = nfory = P,(x). The q’s are defined by 

A) oF , 

(2.12) A (t) = > Gn+i,0t ’ 

(2.13) H'(t)= > Qn+iit”. 


Suppose each side of (2.11) (with y = P,, \ = n) is multiplied by ¢” and a 
summation made from n = 0 ton = o. There result two power series in ¢. 
(2.11) will be established if we show that these series are formally equal. Now 
the right-hand series is 


t >> nP,t” = 1S {Ae™"} = te™"{A’ + xH’A}. 


Also, 
> J"[P.lt” =~ t* >> P,t” = Ae, 


so that the left-hand series is 


DL (quo + 29) J"[Palt” = »» (qo + xqu)t* Ae™, 


n=0 k=l 


and if we use (2.12) and (2.13), this becomes 
tH A’ | 
Ae (! A + 2tH f 


Hence, the two series are equal, and (2.11) holds. 

Since (2.11) is linear and homogeneous, multiplication of a solution by a 
constant again yields a solution. But such multiplication may destroy the 
property of being a zero type set. We cannot therefore obtain a complete 
converse to Theorem 2.2. But we do have 


Coro.uary 2.2. Given an operator J. If a set P satisfies an equation of form 
(2.11), where \ = n for y = P,(x), and if {qu} is related to J by (2.13), then 
non-zero constants {h,} exist so that {|h,P,} is a set of type zero corresponding to J. 
Its determining series A(t) is then given by (2.12). 


For, define A(t) by (2.12), the arbitrary multiplicative constant which enters 
being given any non-zero value. By Theorem 2.2, the set {2,}, of type zero, 
corresponding to J and with determining series A (¢), satisfies (2.11). Now, it is 
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readily found that for \ = n equation (2.11) has a polynomial solution, and 
that this polynomial is unique to within an arbitrary multiplicative constant. 
Hence, {h,} exists so that R, = AP, . 

As a characterization of sets of type zero, Theorem 2.2 is not wholly satis- 
factory, since it involves the operator J of the set. This objection is removed in 


THEOREM 2.3. If P is of type zero, it satisfies an equation of the form 
(2.14) Mly(z)] = » (rio + array” (x) - dy(x), 
k=1 


where X = n fory = P,. Moreover, the operator J and determining function A 
corresponding to P are related to the r’s as follows: 


(2.15) > rott uA’(u) 
k=1 A(u) 

aie) > rat’ = uH’(u), 
k=1 


where u = J(t). Conversely, if a set P satisfies equation (2.14), then non-zero 
constants |h,} exist such that {h,P,} is of type zero. 


To see this we observe that if in (2.11) we write out each J“[y] as a series of 
derivatives of y and collect all terms with the same order of derivative, then to 
each k there are only a finite number of terms in y“’(x). The result of this 
collecting of terms is to give us the equivalent equation (2.14). 

If in (2.11) and (2.14) we replace each derivative y“’(x) by t“, we obtain of 
course the same formal series (since (2.14) is merely a regrouping of terms in 
(2.11)). That is, 


xz 


p> (rio + aru’ = p> (quo + rqu)J*(t). 
On using (2.12) and (2.13), we obtain (2.15) and (2.16). The converse follows 
as in Corollary 2.2. 

One obtains a generalization by replacing y“’(xz) in M[y] by K“[y], where K 
is an operator of form (1.16). This yields the following theorem (proved as 
was Theorem 2.3): 


THEOREM 2.4. Let operator K be given. If P is a set of type zero, it satisfies 
an equation of form’ 


(2.17) Tly(x)] = D0 (rio + 2ra)K*ly] = Ay(2), 
i=1 
where X = n for y = P,. The operator J and determining function A corre 


sponding to P are related to the r’s as follows: 


* We observe, on comparing the coefficient of z* on both sides of (2.17), that ri; = . 








VS 
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(2.18) > nalK (OF & MA@ | 

k=1 A (u) 

7 u = J(t). 
(2.19) > ralK(i)]* > uH’(u), 


Conversely, if a set P satisfies equation (2.17), then non-zero constants {h,} exist 
so that {|h,P,} ts of type zero. 
From (2.11) of Theorem 2.2 there follows a further characterization of sets 
of type zero, expressed solely in terms of the members of the set itself. It is 
THEOREM 2.5. A necessary and sufficient condition that a set P be of type zero 
is that constants Qo , xi exist so that 


(2.20) D> (qxo + ria) Pr_(2) = nP,(z) (n = 1, 2, -- +). 


The operator J and the determining series A for P are related to the q’s by (2.12) 
and (2.13). 

Let (2.7) be differentiated with respect to z. On equating coefficients of like 
powers of ¢, we obtain 
(2.21) P(x) = s8Pai(z) + 82Pn-o(z) + --- + 8Po(z) (n= 1,2,---), 
whence we have 

THEOREM 2.6. A necessary and sufficient condition that a set P be of type zero 
is that constants {s,} exist for which (2.21) holds; in this case the operator J 
corresponding to P is determined through {s,} by means of (2.4) and (2.5). 

Theorem 2.6 will later be seen to generalize to sets of all types. (Cf. Lemma 
5.1.) It is of interest to compare (2.20) and (2.21). The latter involves only J 
(through 7), so that all zero type sets for one and the same operator satisfy the 
same equation of form (2.21). On the other hand, both J and A are involved 
by the constants present in (2.20), so that if sets {P,} and {Q,} both satisfy 
(2.20), then there is ac # 0 such that Q, = cP, ,n > 1;/.e., there is an essen- 
tially unique set satisfying (2.20) for given gio , gu - 

If (2.7) is differentiated with respect to x, the left side is multiplied by H(d), 
and P’, replaces P,, on the right. Recalling that H begins with a term in t, 
we see that Q, = P',4; is a set of zero type, corresponding to the same operator 
J as does P. In other words, we have 

TuErorEeM 2.7. If P is of type zero, then so are the sets {Pri}, {Pra}, 
(Pr.s}, --- ; and they all correspond to the same operator as does P. More 

A simple extension of Theorem 2.6 is the following: Let P be a set with operator J whose 
inverse is H, and let K be an operator of form (1.16). Set 

K(H(t)) = ait + al? + ++. 
Then 
K[P,(2)]) = a, P,\(2) + a2P,2(x) + oss + a, P(x) (n = 1, 2, see) 
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generally, let K be of form (1.16). Then, if P is of type zero, so are {| K[Py41}}, 
{K*[P2]}, --- , and they correspond to the same operator as does P. 


Let us apply the preceding characterizations to some well-known zero- 
type sets. 
Example 1. P,(xz) = x"/n!. Then 


> Pa(z)t" =e", AQ=1, AWM =J =t. 
0 
It is readily determined that (2.11) and (2.14) become 
aP’, = mP,, 
and (2.20), (2.21) become, respectively, 
aP,1=mP,, Pr = Pa. 


Also, (2.17) holds with ry» = 0 and r_, determined from 
Pe ralK (t)|* =. 
k=1 
Example 2. Laguerre polynomials {L,(z)}. Here 
< — f —2t os _ _ = 
XL La(x)t = ( ns :) exP 4 - } A(t) = 7 J(t) = H(t) = —- 
It is found that (2.11) and (2.14) become 
p> (1 — ka)J*[Ln(x)] = nLp(z) Vly] = —y’ —y” —y’” — --), 


(x = 1)Li —_ rL’. == nLn, 


while (2.20), (2.21) reduce to 


(1 = ke)ena(z) = mbna), Ee (e) = — [Ly ala) + Lala) + +=} 


Most of these relations are known. In (2.17), rio, Ts are determined by the 
series 


c co) 


DS rolKO = -t, DralkK@ =t-?. 


k=l k=l 
Example 3. Hermite polynomials {H,(z)}. Here" 


> H,(z)t" = exp {-¢ + 2tz}, A(t) =e", H(t) = 2t, J(t) =} 


n=O 


1! Jt is more common to define H,(z) so that H,/n! is the coefficient of t". We find it 
simpler to adopt the present definition. 





~n A 


rl}, 


eTo- 


id it 
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Relations (2.11) and (2.14) become 
2cH, — H, = 2nH,, 
and (2.20), (2.21) become 
2tHn+»— 2Hi»2=nH,, Hy = 2H». 


These are also well known. The defining relations for rio, rm in (2.17) are 


oo eo 
9 


YS rolKOl = -#, YSrolKOK =. 


k=1 k=1 


Example 4. Newton Polynomials. Here 


x 


> N,(x)t” = (1 + 0)’, Ajt)=1, H@®=lg(1it+d, J =e 1. 


0 


It is seen that (2.11), (2.14), (2.20), (2.21) become, respectively, 


4 eo) k-1 
x p> (—1)* 'a‘N,(z) = nN,(z), x 2 =< N““ (xz) = nN, (2), 


z >, (-1)*"Na-x(z) = nN,(2), 
k=1 


! = ee ee 
Ni (2) = Nal) — 3 Nas(a) +--+) ma. 
In (2.17), roo = O and ry is determined by 


ow 


p> rulK (i) = 1 — e*. 


3. On sets of zero type satisfying finite order equations. In the Bulletin 
paper (cited in footnote 7) those Appell sets were determined that satisfy a 
finite order linear differential equation with polynomial coefficients. Here we 
extend the problem to the case of a finite order equation in an operator K of 
form (1.16), satisfied by a zero type set P corresponding to the operator J and 
determining function A. We first restrict our attention to equations of 
form (2.17). 


In order that a set P, corresponding to a given J and A, satisfy a finite order 
equation of form (2.17): 


(3.1) Tly(x)] = . (rio + arm) K*[y] = dAy(2), 


with AX = n for y = P,(z), it is necessary and sufficient that the following rela- 
tions hold :” 


2 [t was shown in a footnote to Theorem 2.4 that ri, = ky’. 
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™ 


(3.2) p> riolK(t))* = Fo(t), 
(3.3) > mK (OI! = Fi), 


where s = max (m, q) and Fy, F; are defined by 
(3.4) Fo(t) = {uA’(u)/A(u)}, F(t) = (uH’(u)} (u = J(#). 

Suppose P satisfies (3.1). Since rn = 1/k, ¥ 0, at least one rm is not zero, 
and we let rj be the last non-zero one. On the other hand, all the rjo’s may be 
zero. This is true, as we see from (3.2), if and only if A(t) isa constant. (3.3) 
then gives us 

TuHEoreM 3.1. If P corresponds to a given J and A, with A(t) = c ¥ 0, then 
a necessary and sufficient condition that P satisfy a finite order equation of form 
(3.1) is that F(t) be a polynomial in K(t). 

Now suppose that A # c. Then the ryw’s are not all zero, and we let ray 
be the last non-zero one. Since (3.2) and (3.3) are polynomials having a 
common root K(t), their Sylvester determinant vanishes: 


Tm0 Tm—1,0 = és F(t) 


] TOWS {| ss cccccccccccccccccccccccsesccccees 
(3.5) Tm0 Tm—1,0 ne rio — Fo(t) wal 
Tqi Tq-1,1 rm — Fy(t) ° 
M TOWS4| ccccccccccccccccccscccccssccvececes 
Toi 7q-1,1 eee = F(t) 


(In (3.5), elements that are zero are not indicated.) 

Conversely, suppose (3.5) holds for a choice of the r’s, with rmo # 0, ra # 0, 
and let s = max (m, qg). Then there is a common solution K(¢) of (3.2) and 
(3.3), which solution can be expressed as a formal power series in ¢t. Now 
from (1.16), K must begin with a term in ¢. Suppose the common solution K(é) 
does not have this property. Then let us set K(t) = c + K,i(t), where K,(0) = 0. 
On substituting into (3.2) and (3.3), we obtain two polynomials (on the left) 
with constant terms c, ¢:, say. Now for t = 0, since F,)(0) = F,(0) = 0, 
we get c = c, = 0. Hence, K,(t) is also a common solution, and it is zero 
att = 0. We shall then use K,, or, what amounts to the same thing, we may 
suppose that K(0) = 0 to begin with. If K begins with a term of higher 
degree than one in ¢, then so does the left side of (3.3), and therefore the right 
side. But tH’(t) begins with a term in ¢, as does J; hence, so does F,(t). As 
this is a contradiction, it follows that K(t) begins with a term in ¢, and thus 
the common solution K has the properties of (1.16). Therefore P satisfies 
(3.1). This establishes 

TueoreM 3.2. Let P correspond to J and A, where A # c. A necessary and 
sufficient condition that P satisfy a finite order equation of form (3.1) is that Fo(t) 
and F(t) satisfy (3.5) for some choice of ro, Ta (with rmo ¥ 0, ra ¥ 0). 
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As (3.5) is to be an identity in ¢, it will likewise be an identity in wu where 
u = J(t) (so that t = H(u)). Accordingly we have 

Coro.Luary 3.1. In Theorem 3.2 the functions F(t), Fi(t) may be replaced 
by uA’(u)/A(u), uH’(u). 

Theorem 3.1 can be generalized to 


THEOREM 3.3. If in Theorem 3.1 the condition “A(t) = c # 0” is replaced by 
the condition 
Pp 


(3.6) ‘o(t) = > op[Fi(t)]” (c, ¥ 0), 


k= 
then the same conclusion holds. 


For, K(t) can be defined by (3.3), whereupon F(t) is expressible as a poly- 
nomial in K(t). That is, an equation of form (3.2) holds. Hence, P satis- 
fies (3.1). 

There are infinitely many pairs of functions Fy , F; satisfying a given relation 
(3.5). One can, for example, give one of Fy, F; arbitrarily. This suggests 
examination of the following question: Given one of the three elements J, A, K, 
to what extent are the others determined so that (3.1) holds? 

Case I. Given J(t). This determines H(t) and therefore F(t). In fact, one 
easily finds from (2.5) that 


(3.7) Fy(t) = J(t) + J’. 


DEFINITION. Given a function (or formal series) f(t), beginning with a term 
int. We denote by §{f(é)} the class of all (formal) series z(t), beginning with 
a term in ¢t, satisfying a relation of the form 


(3.8) pz + poe + --- + pre” = f(d), 


the p’s being constants, with p, ~ 0. §{f(t)} thus represents a special class of 
algebraic functions of f(t). 

In terms of this definition, we see that K is determined from (3.3) as a member 
of the class #{F,(t)}. That is, J being given, K must be in §{F,(é)}, but can 
be an arbitrary member of this class. Consider any such K(t). From u = J(t) 
follows t = H(u), so that K(t) = K(H(u)) = K*(u). A(u) is then determined 
by the (necessary and sufficient) condition that uA’(u)/A(u) be a polynomial in 
K*(u). Observing the wide choice possible for K, after which a further wide 
choice for A exists, we see that to each operator J correspond a large variety of 
polynomial sets P satisfying an equation of form (3.1). 

Case Il. Given K(t). Then F(t) is to be a polynomial in A(é), lacking a 
constant term and with coefficient of the linear term equal to ki’. For all 
such F; , we determine J(t) from (3.7). Having now K and J, we obtain A 
as in Case I. 

Case III. Given A(t). Let K*(u) be any member of the class §{uA’(u)/ 
A(u)}. Determine uH’(u) as any polynomial in K*(u) beginning with a linear 
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term, and solve for H(u). Relation (2.5) gives us J(u), and with this J we 
determine K(t) from the identity K(t) = K*(J(#)). 

As illustration, choose K(t) = t, so that (3.1) is a differential equation. From 
Case II we see that 


J’ = a 

J tQ(t)’ 
where Q(t) is any polynomial of form 
(3.9) QQ =1+ht+--- +12, 
and that therefore 

‘1 — Q@) ; 

3.10 J(@) = a- | QO us. 
(3.10) J(t) = et exp { » 100) dt 


The inverse function H can be found from the power series for (3.10) or from 
the differential equation” 


(3.11) uH'(u) = H(u){1 + hH(u) + --- + L,A*(u)}, 
with the condition that H(u) begins with the term ¢/c. And finally, A is ob- 
tained as the solution of the differential equation 


uA'(u) _ - 
A(u) _— bi H(u) + = + bm H (u), 


where the b’s are arbitrary, but b; + 0; i.e., 
(3.12) A(u) = vex { | * [bH(u) +--+. + baw) du}, 
0 


y = arbitrary constant. 

Relations (3.10), (3.11) and (3.12) are thus necessary and sufficient conditions 
that equation (3.1) be satisfied for K(t) = t. 

It has already been remarked that (2.17) is not the only linear functional 
equation satisfied by a set P. In fact, extending a result in the Bulletin paper 
(loc. cit., p. 914), it is easy to show that given an operator K of form (1.16), 
and given any set P (which need not be of type zero), polynomials {L,(z)} 
with L, of degree S n, and characteristic numbers {A,}, can be chosen, and 
indeed in infinitely many ways, so that the set P is a solution of the equation” 


(3.13) Lly(2)| = > Ln(2)K"ly] = dy 


(with X = A, for y = P,). 


13 K(t) =t = K*(u) = K(H(u)). Therefore, H(u) = t, and K*(u) = H(u). (3.11) then 
follows from (Q(t) = uH’(u) (u = J(t)) if we write t = H(u). 

14 It is no restriction to have the summation begin with n = 1, for if a term n = 0 is 
present, it is of the form Loy = cy, and this can be absorbed into the right side. The only 
effect is to alter all the \,’s by the amount —c. 
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An equation (3.13) is said to be of finite order r if L,(x) = 0 for n > r, but 
L, # 0. We now investigate conditions under which a set P of type zero 
satisfies a finite order equation of form (3.13). 


LemMA 3.1. A set P (not necessarily of type zero) cannot satisfy two different 
equations of form (3.13) uf the characteristic numbers are respectively the same. 


For, suppose P satisfies (3.13) and also L*[y] = Ay (whose coefficients and 
characteristic numbers are L*(x) and Ax = Xx). By subtraction, 


Ms 


(Ln — L*)K"[P,] = 0 (s = 1,2, ---). 


n=1 


Now, K"[P.] = 0, n > s, and K"[P,] = constant # 0. Hence, on setting 


s = 1, 2, --- successively, we find that L, — L? = 0,s = 1. The two equa- 
tions, supposedly different, are thus identical. 

If we set 
(3.14) L, (2) = Ino + +++ + Int”, 


the characteristic numbers A, of (3.13) are given by 
(3.15) An = nkily + n(n — 1)kilee + --- + nbkTlan (n = 0,1, --- ). 


This is seen on equating the coefficient of x” on both sides of L[P,] = AP... 
Suppose P is a set of zero type. We know that it satisfies (2.17), which is a 
particular case of (3.13): 


(3.16) T (Px) = Do Su(x)K*|Ps] = nPo, 
k=1 

where 

(3.17) Su(z) = reo + 2ru, ru = 1/k, # 0. 


Now define operators 7; by 
(3.18) Tr ly] = T(T — 1)(T — 2)--- (T — k& + ly). 


TuroreM 3.4. If the zero type set P satisfies (3.13), then (3.13) can be ex- 
pressed in the canonical form 


oo 


(3.19) Lly] = Dd an Taly] = dy, 


a= 


where 1', is given by (3.18) and a, by 


(3.20) an, = lan ki . 
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To see this, denote the operator of (3.19) by L*[y]. If in (2.17) K[y] and 
its iterates are replaced by series in derivatives of y, using (1.16), we can write 
Tly] as” 


(a) T ly) = > Qu(x)y (x), 


where Qy, is a polynomial of degree < 1. Iteration gives 


(b) T"ly] = > Qni(x)y (x), 
where Q,, is of degree < max (n, k). From this it follows that 
(c) T aly) = > Rux(x)y (2), 


R,. being of degree S max (n, k). 
° (k) ° ° . ° " 
On replacing each y“’(x) by its equivalent as a series of iterates of Aly), 
16 
(c) becomes 


we 


(d) T ly] = do Snx(x)K* yl, 


k=1 


where S,,4 is a polynomial of degree < max (n, k). Since from (3.16) 7|P,] = 


nP,, , therefore 

(3.21) TAP.) = kk —1)--- (kK —-n+1)Pk, 

and in particular, 

(3.22) T [P| = 0, n>k. 
Using (3.22) in (d) fork = 1, 2,---,n — 1, we find that S,, = 0,k < n, 8 


that (d) can be written 


we 


(3.23) Talyl = DO Sae(x)K* yl. 
k=n 
If we substitute this expression into L*{y] (given by (3.19)) and collect like 
iterates of K, we obtain for L* the form 


L*[y] = > for Siz + +++ + axSee} Kyl. 


That is, L* can be written in the form of (3.13). By Lemma 3.1 it will follow 
that L* and L are identical if we show that the respective characteristic numbers 
are the same. For L the numbers are X, , given by (3.15). From (3.21) we 


16 Since K(t) begins with a term in ¢, and therefore K*(t) with a term in (*, the coefficient 
of any y“)(z) in (a) is obtained from only a finite number of coefficients of (2.17). Hence, 
the coefficients in (a) are well-determined. 

16 The point of the preceding footnote applies here also. 
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see that A* (for L*) is given by 
c 2) 


x = > ax-n(n — 1) ---(n—k+1), 


k=1 
and this is precisely A, . The theorem is thus establisned. 
Coro.uarRyY 3.2. Under the conditions of Theorem 3.4, the coefficients L,(x) of 
(3.13) are given by 
(3.24) L,(2) = a S1n(2) + lied ic + OnSnn(X) (n = i, 2, “s ). 


To justify the phrase “canonical form’, it should be shown that every equa- 
tion of form (3.19) has a solution of zero type. That is, 


THroreM 3.5. Let K be an operator of form (1.16), and let T[{y] be of form 


T ly) = Do (rio + zru)K* ly] 


with ry = 1/k, # 0. Then for every choice of a’s (not all zero), the equation 
L{y] = p> An Ta ly] = Ay 


is satisfied by a zero type set. 

In fact, 7[y] serves to define a zero type set P by virtue of Theorem 2.4 and 
the relation (2.17). This same set P will clearly satisfy the above equation 
Lly] = Ay, and this is what was to be shown.” 

LemMA 3.2. In order that a zero type set P satisfy a finite order equation of 
form (3.13) it ts necessary and sufficient that in the canonical form (3.19) (into 
which (3.13) can be cast) the following two conditions hold: 

a, = 0, n>Tr; 

a Sin(z) + +++ + a Sn(x) = 0, n>r. 


If r is the smallest positive integer for which this is true, the equation is of order r. 


(3.25) 


For, if (3.13) is of order r, then from (3.20), a, = 0, > r; and from (3.24) 
the other relation of (3.25) follows. Conversely, suppose (3.25) holds. Then 
(3.24) yields the relations L,(x) = 0, > r. The assertion as to the order is 
obvious. 

The function K(t) can be expanded in a power series in J(t), where J is the 
operator for set P: 


'’ However, it cannot be asserted here (as was the case in Theorem 2.4) that if Q is any 
set satisfying L[Q,] = \.Q, , then there exist non-zero constants c, such that Py = ¢nQn 
isof type zero. For, it may now happen that twe@ or more \’s are equal. Suppose Amn = An, 
which value we call \’. Qn, and Q, are solutions of L[y] = \’y, and therefore so is aQn + bQ, 
for all constants a and b. The argument used in Theorem 2.4 (or rather first used in Corol- 
lary 2.2) is thus no longer valid. And it cannot be successfully amended. 
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(3.26) K(t) = nJ(@) + reJ*() + ---. 
Similarly, 
K*(t) = red" + ried" O + + (i = 1, 2, ---), 


F its (x rat’) . 


n=1t n=l 


(3.27) 


Recalling that J[P,] = Pn, we find that 


( K[P,] = nPra t+ ret oe, ee 
)K *(P.] = re Py 2 + 73 Pas + - 


| K' [P| a raP a + rn Pri + 2a s 
And on using these relations and also (3.21) in (3.23), we obtain the further 


relations 


[ nP, -2 Sij{riP n—1 + r441Pria + coe Tin Po} (n sated l, 2, +++); 


| 
(3.29) <¢ 


n(n —1)---(n—-K+1)P, 
= > Sij{rizPnj + +++ + tin Po} (n= k,k +1, --+>). 


j=k 


Equations (3.29) enable us to determine the S,,;(x)’s in terms of the set P. 
Let us define 6(t) by 


(3.30) O(t) ~ rit + ref + --- 
Comparison with (3.26) siows that 
(3.31) O(J(t)) > Kb). 


Suppose the first relation of (3.29) is multiplied by ¢” and the result summed 
froomn = lton = «. The left side becomes > nP,t", while the right side 


is seen to reduce to 


(x Pat! {Su D rit + Sa Do rnt” + seed, 


0 


From the definition of r;;, the series in the braces represent O(¢), O° (t), «+ 
That is, 


} nP,t’ = (= Pat’) 1S,,0 oa Sp. “+ eee . 
n=! 0 


Now p > P,t" is given by (2.7). We thus have 
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(3.32) 5 {A(te™"} = Ae” {Su(z)O(t) + S(x)O*() + ---}. 
In similar manner we obtain from the second relation of (3.29), 


(3.33) t ae {Ae**} = Ae" { Soo” + S2;0° + «+. r 


and from the general relation of (3.29), 


ak 
0 z z Y c Y . 
ose) . at* {Ae*"} = Ae "{Si.0° + Se,2410° + ---}. 
These relations permit us to establish 
THEOREM 3.6. Let P be of type zero, with operator J and determining func- 
tion A. In order that P satisfy a finite order equation of form (3.13) it is necessary 
and sufficient that constants a, --- , a, exist, not all zero, such that the function 


—zH(t) 0 r 
. , te] zx 0 tH 
3.35) t,x) =‘ s= td) + «.. pt — {Ae*™} |, 
( ) Q(t, x) io | at ‘°° Ba +a agr 16 j 
when expressed as a power series in O(t), reduces to a polynomial in O(t). 

Q can be written as a power series in ¢, and can therefore (formally) be ex- 
pressed as a series of powers of O(t). More precisely, from (3.32) to (3.34) 
we have 

Q(t, z) = {a Sujo(t) + --- + {aS + --- + a, S,-}O'(O 
(3.36) . hn 
+ DO faiSin + +++ + ar SnjO"(. 


n=r+l1 

Suppose P satisfies a finite order equation, so that conditions (3.25) hold for 
some r. Then Q(t, x), as seen by (3.36), reduces to a polynomial in O(t). The 
necessity of Theorem 3.6 is thus proved. Conversely, suppose that for some r 
Q(t, z) is a polynomial in @(t). We wish to show that set P, corresponding to 
the A and J in terms of which Q is defined, satisfies a finite order equation. 
We know that P satisfies (2.17). Using operator 7’ of (2.17), we form the 
equation 


(3.37) Liy] = DX ox Trly] = dy, 
k=1 


which is also satisfied by P. It is this equation that we shall prove is of finite 
order. 

If Lly] is recast in terms of*K[y], so that it is of form (3.13), the coefficients 
L,(z) are given by (3.24), with a, = 0 forn > r. Now from (3.36), since Q 
isa polynomial in @(¢), there is an integer s such that 


Sin + eee + Arsen = 0, n> 8. 


Hence L,,(z) = Oforn > s. This establishes the sufficiency. 
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The condition of Theorem 3.6 is more serviceable than is that of Lemma 3.2, 
since the function Q is determined directly from A and H. We can rid our- 
selves of the function 6(¢t) on making use of (3.31). It gives us 

Coro.uaRy 3.3. The zero type set P satisfies a finite order equation if and 
only if constants a, , --- , a, (not all zero) exist so that Q(J(t), x) is a polynomial 
in the function K(t). 

Whenever the choice r = 1 is permissible, the condition of Theorem 3.6 (or 
of Corollary 3.3) is seen to reduce to the conditions (3.2), (3.3) already met. 


Coro.iuary 3.4. If P satisfies an r-th order equation (3.13), then for this 
equation Q is given by 


(3.38) QUt, x) = L,(x)0°(). 


This follows from (3.36). 


Corollary 3.4 enables us to show that neither the Legendre set {X,(x)} nor 
any set {c,X,} is of type zero. The Legendre polynomials are given by 


oO 


(QQ — 2% +f)%*= Dd X,(a)t". 
0 
If {X,} is of type zero, then the left member is of the form exp {xH(t)}. This 
is readily seen to be impossible. 
Now suppose {P,, = ¢,X,} (cn ¥ 0) is of type zero. X, , and therefore P,, 
satisfies the finite order equation 


(1 — 2°)y” — ry’ = ry 


with A = —n(n + 1) for y = P,. Here the operator K[y] is merely y’(z), 
so that O(t) = H(t). Also, L; = —2z, Ll, = 1-— 2°. Hence, from the corollary, 


Q(t, 2) = L,H + LoH® = —2cH + (1 — 2°)H’. 


If we equate coefficients of like powers of x on both sides, we get from the r 
terms: (H” = H’, so that H = ct; and on using this result in the equation 
obtained from the « terms, we find that A(t) = constant. Finally, the constant 
terms tell us that H = 0 so thate = 0. Hence, >> P,t" has for sum a constant. 
This contradiction shows that |P, = ¢,X,} is not of type zero. 


4. Zero type sets that are Tchebycheff sets. The Hermite polynomials are 
Appell polynomials, and are thus of zero type. They are also Tchebycheff 
orthogonal polynomials.” Another orthogonal set of zero type is the Laguerre 


18 The definition of H,(z) in Example 3 of §2 requires modification in order to satisfy the 
condition H'(z) = H,_\(2) for an Appell set. But such alteration consists only in mul- 
tiplying each H, by a suitable non-zero constant c, . This being done, it is known that the 
Hermite set is essentially the only Tchebycheff set that is alsc an Appell set. 
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set (cf. (2.10)). This suggests the problem of determining all zero type sets 
that are orthogonal. 

J. Meixner” has treated this problem by the use of the Laplace transforma- 
tion, taking (essentially) the relation (2.7) as the definition of the polynomials 
under consideration. It is possible to give a quite different treatment by means 
of the known properties of zero type sets, and this we do here. 

As a characterization of an orthogonal set {Q,} we take the relation” 


(4.1) Qn(z) = (@ + An)Qna(Z) + wrQro(z) (n= 1,2,---), 


hn» Hn being real constants with u, # 0,n > 1. If {Q,} is an orthogonal set, 
so is {¢nQn}, Cn # O (although the multipliers c, can spoil normality if Q, has 
this latter property). We shall therefore set the problem as follows: For what 
sets {Q,} satisfying (4.1) do there exist non-zero constants c, such that 


(4.2) P,(z) = c,Q,(2) 
is a set of type” zero? 

Suppose that {P,,} of (4.2) is of type zero. From (4.1) we obtain an expression 
for nP,(x). Comparing this with the value of nP,(x) as given by (2.20), we 
obtain (on equating coefficients of like powers of x): 


(4.3) mien = Coir, Qin = Grogan + (mn — 1)ga, 
(4.4) GitMn41 = n{ qoogir — 109192 — (q1 = qurqa1) (n — 1)}. 


That is, A, is at most linear and yz, at most quadratic, in n, and yu, has a factor 
(n — 1). 

19 Orthogonale Polynomsysteme mit einer besonderern Gestalt der erzeugenden Funktion, 
Journal of London Math. Soc., vol. 9 (1934), pp. 6-13. 

*” As justification we observe first that every set orthogonal according to the classical 
definition satisfies a relation of form (4.1); and secondly, that Shohat has shown that a 
necessary and sufficient condition that a set {Q,} (normalized so that the x" term has a 
coefficient unity) be orthogonal with respect to a weight function ¥(z) of bounded variation 
in(—«, + «) is that {Q,} satisfy (4.1) with », = 0 foralln> 1. (J. Shohat, Comptes 
Rendus, vol. 207(1938), pp. 556-558.) 

We note that in the Shohat definition of orthogonality it is tacitly assumed that no 
member of an orthogonal set is orthogonal to itself (relative to the given weight function y). 
It is easy to show that if an ‘‘orthogonal’’ set satisfies (4.1) with », = 0 for some n > 1, 
then at least one polynomial of the set is self-orthogonal. Thus, for example, the set 
|z"} is “‘orthogonal’”’ for the following choice of y: 


jl, z = 0; 


V2) = 10 2 #0. 


This set also satisfies (4.1) with A, = wu. = Ofor all n. This apparent contradiction to the 
theorem of Shohat is resolved when we note that x” is orthogonal to itself for every n > 0. 
A colleague, H. L. Krall, has made the further observation that every set P for which 
P,(0) = 0 (n > 0) is “‘orthogonal’’ relative to the same function y above. 
*t We shall also assume for convenience that Po(x) = 1. This is no essential restriction 
since it means only that all the polynomials P,(x) are multiplied by one and the same non- 
zero constant. The property of being of zero type is thus unaltered. 
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This condition is also sufficient. For suppose A, , uw». have this form, and 
let {Q,} be defined by (4.1). Let a ¥ 0 be any number and set 
(4.5) Cr = a”/n! (n = 0,1,---). 
Now define {P,} by (4.2). We are to show that P is of type zero. From (4.1) 


and (4.2) follows a relation of form 
(4.6) nP,, = (ax + B + ny)Po-a + (8 + n©)Pr-2, 


a #0,6+ne#O(n> 1). From this we obtain zP,_, as a linear combination 
of P,, Pat, Paz. It is now a straightforward matter to show that con- 
stants quo, Ya exist so that 


Tn = (Qo + 2Qu)Pn—1 + (20 + 2G21)Pr—-2 + -:> 
is identically equal to nP,. They are, in fact, determined by the relations 


(4.7) Gk+2.1 = VOr+ia + qe, 


] 
(4.8) desio = P fqis(d — (Kk — Le) + Qesaal(B — vk) + (K+ l)qese.}.- 


Thus (2.20) holds, and P is of type zero. That is, we can state 

THEOREM 4.1. <A necessary and sufficient condition that an orthogonal set {Q,}' 
given by (4.1), be such that P, = c,Q, is of type zero for some choice of c, # 0 
is that Xr», un have the form 
(4.9) An = a + bn, lin = (n — 1)(c + dn), 
withe + dn # 0 forn > 1. 


As it stands this criterion does not reveal the sets P that are both orthogonal 
and of type zero. We therefore examine the problem more closely. Relation 
(4.8) shows that {qo} is determined when {qu} is known. Let us then turn to 
the recurrence relation (4.7). The characteristic equation is 


(4.10) ue — yu —¢« = 0. 
Case I. y + 4€ = 0. Using the initial conditions qu 
obtain 
(4.11) qu = ak(hy)*", — quearo = (47)" "(4 (By + 26)k + 47(8 + 7)}. 
Then from (2.12) and (2.13), we get” 


2, The presence of the parameter » removes the earlier condition that Po(z) = 1. It 
should also be noticed that we must have y ¥ 0 in (4.18). The case y = 0 is special. For 
if y = 0, then e = 0, and 

qu = a, qe = 0 (k > 1); qi = B, qu = 4, qe = 0 (k > 2). 
Hence 
H(t) = al, J(t) = t/a, A(t) = w-exp {Bt + 4607}, 


where 6 = Oin order that the condition yp, # 0 be fulfilled. 
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a 2at oe 
H(t) = cs J(t) = 


“J 4(By + 26) 
) exp {rt 


—2,2 Bie 
w = — (y — 26) ¥ a non-negative integer 
a) 


(4.12) 


where 


(in order that uw, # 0). 
Case Il. y + 4¢€ #0. Let uy, uw be the roots of (4.10). Then 


(4.13) qu = aly’ + 46) ath — us}, esto = (y’ + 4e) hut — ous}, 


where A, o are constants whose values are readily obtained. Consequently 


(4.14) H(t) = [ dt 
al "bh inwanae 


Case Il,. ¢€ = 0 (sothaty # 0). Then 


| H(t) = a. log (1 = yt) J (t) = ( —_ exp |). 
| 7 Y \ 


a) 


A(t) = w(l — vt)*-exp { —&t, 


¥ 
where 


1 ° 
3 (Or + ¥' + 8), 


and where 6 ~ 0 in order that u, + 0. 
Case Ilz. ¢€ # 0. Let r; = 1/u;. Then 


, et et ma es he 
fri(t — re)\ I) = | 1() = (1 — wet) 


— m , 
ue"? — Us (1 — ut)” 


l 
(4.16) H@) = - log 
Oe ee a 


where 


ee u(B + y) + (6+ 2e) 


uy 


(4.17) p= : (y° + 46)’, hi = (y' + 4 
Qa 


It is to be noted that in all cages H and J do not involve the parameters 
8, 6, 4 and A does not involve a. It follows that all sets satisfying a relation of 
form (4.6) and having the same a, y, € correspond to the same operator J; and all 
sels satisfying (4.6) with the same B, y, 5, € have the same determining” function A. 

The many relations obtained for H, J, A involve the original parameters 
a,--.,e€ and yw, sometimes in complicated manner. This suggests the possi- 


* At least to within a constant multiplier (because of the presence of x). 








614 I. M. SHEFFER 


bility of simplifying by introducing independent combinations of the original 

parameters as new parameters. In fact, the following relations summarize the 
. fa bl . ‘ : 24 : 

various cases. They are, in order: Cases I, I special,” I], , I: ; that is, 


at a a = _ d \ 
(4.18) H(t) = i — hk’ i? A = p(l — bt) exw {,-*y}, 
where a, b, c, d, u are arbitrary, but abcu ¥ 0. 

(4.19) H = at, J= -, A = wexp {bt + ct’}, 


a, b, c, uw arbitrary, but acu + 0. 


(4.20) H = a log (1 — bt), J =—{1 — e"*}, A = pe"-(1 — bt)* 


a, b, c, d, wu arbitrary, but abey # 0. 


ele tie a fot — &)\ ic ef — 1 i - ‘" ( ” 4} 
(4.21) H = ~ log le@ = bf” J =be ase Bs A=n,nil 1 5) ° 


a, b, c, d; , dz, « arbitrary, but abeu + 0 and” b # c. 

We therefore have 

THEOREM 4.2. Let P be of zero type, with operator J and determining fune- 
tion A, so that (2.7) holds. A necessary and sufficient condition that P be an 
orthogonal set is that J, H, A satisfy one of the conditions (4.18) to (4.21). 


Coro.iary 4.1. According to the case, the function Ae™” of (2.7) assumes 
the form:” 


(4.22) Ae™” = y(l — bt)°-exp ‘4 + I (abcu ~ 0), 
(4.23) Ae*™ = y-exp {t(b + ax) + ct’} (acu ~ 0), 
(4.24) Ae™ = ye*.(1 — bt)**™ (abc ¥ 0), 

. ill t d,+z/a t d2—z/a 
(4.25) Ae” = pill —- = ‘1 — b (abeu # 0, b # ¢). 


The Laguerre set is a particular case of (4.22) and the Hermite set of (4.23). 
If it were permissible to choose c = 0 in (4.23) and (4.24) we would have as 
particular cases, respectively, {z"/n!} and the Newton set. Hence, these two 
sets, while not Tchebycheff sets, are nevertheless limiting sets of Tcheby- 
cheff sets. ’ 

If we form the functions {uA’(u)/A(u)}, {ull’(u)}, evaluated for u = J(0), 
we find in the respective cases that 

24 Case I special refers to an earlier footnote under Case I. 

** Also, the condition 6 + en # 0 (n > 1) is to be translated in terms of the present 
parameters. 

© (4.25) is subject to the restriction mentioned in the preceding footnote. 
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(4.26) fuA } o Mua} +e (uH’} =! (a +0); 
\ A a’ a 
=, fuA’\ _ bt , 2et’ —— 
27) “p= ot a {ul} = t; 


4’) 
(4.28) e f = (1 — e“")(ce“* — bd) + be”, {ull’} = a(e“* — le“; 


fuA’) be(1 — e**) ds (e** — 1)(ce** — b) 
( + f ‘i 
4-29) ) A = (c — b) 2 b | iull”) a(c — b)e* 


Since in (4.26) and (4.27) the expressions are polynomials, it follows from 
Theorem 2.3 that the sets P of the first two cases satisfy the respective finite 
order equations 


4.30) Mly(z)] = 2 (a — 0) + 2} y'(a) + ne + aly") = dy(2), 


(4.31) Mly(zx)] =( + hy (x) + yx ) = dy(z), 


where \ = nfory = P,. The sets for the last two cases clearly do not satisfy 
a finite order equation of form (2.14). 


5. Sets of higher type. Although the present paper has as its main purpose 
the treatment of zero type sets, we propose in this section to indicate some 
extensions to sets of higher type. The definition of higher type depends on 
what characterization of zero type sets one wishes to generalize. We have 
given one definition in §1. This we shall call A-type. Thus: A set P is of 
A-type k if in (1.12) the maximum degree of the coefficients L,(x) is k. If the 
L,’s are of unbounded degree, P is of infinite A-type. 

Let P be an arbitrary set. There exists a unique sequence of formal power 
series {M,,(t)} of form 


(5.1) M,(t) = Mnpl” + ma + we Siac (Man = 0) 


such that” 
(5.2) e* =~ DY P,(z)M,(). 
n=0 


DreFINITION. We shall term the set of series (or functions) M: {M,(t)} the 
E-associate of set P. 

CoroLuary 5.1. Let P be a set and M its E-associate. A necessary and 
sufficient condition that P be of type zero is that formal series 

*7 For our purpose it is a matter of indifference whether or not the series symbolized by 


M.(t) converge. (5.1) is regarded as a “‘carrier’’ for the coefficients m,,, and (5.2) is merely 
aconcise way of writing infinitely many linear equations in these coefficients. 
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(5.3) A(t) = > dnt” (a # 0), J() = > cnt" (ce, ¥ 0) 

exist so that 

_ (J@r _ fu" \ = 

(5.4) M,(t) = AW) om \aqwf (u = J(2)). 


For, if P is of type zero, (2.7) holds. And from (5.2), on setting t = H(u), 


A(uje™™™ = ¥ P,(x)M,(H(u))A(u), 
so that 


A(u)M,(H(u)) = u”. 


(5.4) follows on inverting: u = J(t). 

Conversely, suppose (5.3) and (5.4) are satisfied. Define the set Q to be of 
type zero, corresponding to operator J and determining function A, and let M* 
be its E-associate. Then from the half already established, M* has the value 
given by (5.4). That is, M*(t) = M,(t) for all n. But just as M is uniquely 
determined from knowledge of P and (5.2), so is P uniquely defined by (5.2) 
when M is given. Hence, Q and P are identical, and P is of type zero. 

We now characterize sets of A-type k. Let P be such a set. Then, as we 
know, 


(5.5) LP.) = Pr, 

where 

(5.6) Ly(x)] = Joly] + rily) + --- + 2*Jalyl, 

Jo, +++, being linear differential operators with constant coefficients such 
that 

(5.7) Jelt) = age + angel ? +---, OStSk. 


Also, in order to insure that LZ carries every polynomial into another of degree 
one less (since L[P,] = P,»-1), we have the further condition 
En = Qn + nay + n(n — l)ag +--- + n--- (n—k + lag ry 40 


(5.8) 
(n = 0,1, ---). 
From (5.6) and (5.2) we have 


Lie] = {Jo(t) + Jit) + --- + 2 Jil) fe” 
(a) Ea ed 
~ DY LP.IM,(t) & DD Pa(x)Marll)- 











()). 


(u), 
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M* 
ilue 
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5.2) 
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But also, on differentiating (5.2) k times in ¢t, we get 
(b) {Jot +++ +a dase = DO Paz) {JoMn + JiMi + «+» + Ji MQ}. 
0 


Again, in (a) and (b) both the coefficients of P,(x) are power series beginning 
with a term in t"”’ (ef. relation (5.8)). These coefficients must therefore be 
identical. That is, 


(5.9) Mnsi(t) = Jo(t)M.(t) + JiQMi) + --- + AeOMOP) (n = 0,1, ---). 


This proves the necessary part of 

THEOREM 5.1. Let P be a set and M its E-associate. A necessary and sufficient 
condition that P be of finite A-type is that for some k there exist formal power series 
(5.7) satisfying condition (5.8) such that M satisfies (5.9). Moreover, if J,(t) 4 0, 
P is of A-type k. 

The sufficiency is established as follows: (5.7) and (5.8) determine an operator 
L of form (5.6). The first part of (a) holds to give 


(c) Lie] = DO LPilMn = DO LP asilMas, 
0 0 
while (b) continues to hold. On using (5.9), (b) reduces to 
(d) Lie] = 2 PaMaun, 
0 
so that 
(e) DX {LlPriil — Pa} Mau) = 0. 
0 


This formal identity means that if we rearrange in a power series in ¢, all coeffi- 
cients must vanish. Recalling the form (5.1) of the functions M, , we see that 
all the braces likewise vanish. That is, L[P,] = Pr,n 21. That L[Po] = 0 
isimmediate. Hence P is of A-type S$ k. The statement in the theorem con- 
cerning the precise type number is evident. 

If k = 0, (5.9) reduces to M,4, = JM, (dropping the subscript 0); i.e., 
M,(t) = M)(t)[J(é]". This coincides with the previously found condition (5.4) 
if A(t) is chosen so that M,(t)A(J(t)) = 1. 

It has already been observed that if M is any sequence of power series of form 
(5.1) (which includes the condition m,,, # 0), then (5.2) uniquely defines a set P 
for which M is the E-associate. To say that M is the E-associate of some set P 
is then equivalent to saying that M is of form (5.1). 

From Theorem 5.1 we can prove 

THEOREM 5.2. Let M be of form (5.1), so that M is the E-associate of a set P. 
A necessary and sufficient condition that P be of finite A-type is that k exist so that 
the following (k + 1) ratios of determinants are independent of n: 
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M. --- Mo" Ma, MY... W@ 


Ast) =| «++ eee tee eee coe tee 
= Mase 26 M on Mnsnsi Mise eee M, 
hata M. Ma «-- MO 
+> eee eee ef (j = 0,1, ---, k). 


Mare Mase -> Ms 


And if A,(t) # 0, the A-type is precisely k. 

First, suppose P is of A-type k. If in (5.9) we replace n by n, n + 1, ---, 
n + k respectively, we obtain k + 1 equations for Jo, --- , J, whose solution is 
given by the determinant ratios” in (5.10); i.e., J ;(t) = 4,(t). Thus the condi- 
tion is necessary. Conversely, suppose (5.10) holds, the A,’s being independent 
of n. Then (5.9) is satisfied by the choice J; = A;. The sufficiency will now 
follow from Theorem 5.1 if we show that J; satisfies conditions (5.7) and (5.8). 

Suppose the series (5.1) is substituted into (5.10). It is found that the 
numerator and denominator have as lowest terms, respectively, 


n+k 
anf" FF Mi , 6, t™. [] Mi, 

where 

| n eee n(n — 1) ---(n—k+1) 

ae (n + 1) eee (n+ 1).- alii athe 

| (n+ k) eee (n +k). - (n+ 1) 
and where a;, is obtained by replacing the (j + 1)-th column of 8, by the ele- 
ments Misn isn + Misn-1ien—1 (@ = 1,2, ---,& +1). Hence J; cannot begin 


with a term of degree less than (*' if we show that 6, ~ 0. If in 8, we subtraet 
each row from the one following, we obtain a new determinant (of value 8,), 
which is k! times 8%, where 8.. is obtained from 8, by changing k to k — 1 
Working down to k = 1 we get 8B, = k\(k — 1)! --- 2! 11, and this is not equal 


to zero. 


Thus, condition (5.7) holds. There remains to establish (5.8). Since A; = J; 
begins with the term t’"’a;, + 8, (or a term of even higher degree), we see on 
comparing with (5.7) that a;,j41 = ajn + Bn. Hence aj,j41 (j = 0, 1, --- , &) 


is the solution of a system of (k + 1) linear non-homogeneous equations, the 
matrix of whose coefficients is given by the elements of the determinant §,, 
and whose non-homogeneous terms are the quantities Miyn.i¢n + Mitn—1,i4n—1- 
The first of these equations is 

Mnr+in+1 


oy + Ny + ++» + n(n — 1) -+- (Nn —k + Ddeaesr = 


Mnrn 


2* That the denominator does not vanish identically is demonstrated later in the proof, 


where it is shown that #8, # 0. 
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The left side is the quantity &, of (5.8). As m, # 0 for all 2, sois &, # 0 for all 
n. Thus (5.8) holds and the proof is complete. 

We come now to a second definition of type. 

Lemma 5.1. To each set P corresponds a unique sequence of polynomials 
(7',(x)}, with T,(x) of degree not exceeding n, such that” 
(5.11) PQ = ToPaa + T1Po-2 +--+ + Tr-1Po (n = 1, 2, ---). 

This is seen if we set n = 1, 2, --- successively. It is to be observed that the 
T,,’s do not determine the P,’s uniquely. In fact, there are infinitely many sets 
satisfying (5.11) for a given sequence {7',}. 

DeFINiTION. A set P is of B-type k if in (5.11) the maximum degree of the 
polynomials T,(x) isk. Otherwise, P is of infinite B-type. 

If P is of B-type zero, the 7’,’s are constants, so that (5.11) reduces to (2.21). 


That is, P is of A-type zero. The converse is also true: 


Coro.uarRy 5.2. A set P is of B-type zero if and only if it is of A-type zero. 


Let 
(5.12) H(a,t) = > P,(x)t", 
0 
(5.13) T(z, t) = > T,(x)t”. 


Then (5.11) is seen to be equivalent to the relation 


HT = Of 
Ox 
which, when solved for H, gives us 
ni 
(5.14) H(z, t) = A(t) exp it] T(z, 0) dr, 


where it is to be understood here and later that A(é) is an arbitrary power series 
beginning with a (non-zero) constant term. Conversely, if 7’ is any series (5.13) 
where 7’, is a polynomial of degree not exeeeding n, then H(z, t) as defined by 
(5.14) is such that (5.11) holds.” 

If T(x, t) is written as a power series in x rather than ¢, (5.14) assumes the form 


(5.15) H (a, t) = A(t) exp {xHi(t) + 2°H2(t) + ---}, 

*? This is an extension of Theorem 2.6. 

*® The 7',’s are not completely arbitrary. For P to be a set, it is necessary that P, be 
of degree exactly n. This reflects itself in the non-vanishing for n = 1, 2, --- of certain 
polynomials in too , t11 , foe , «+: , Where ¢;,; is the coefficient of x‘ in T;(x). These conditions 


can be obtained from (5.11) by demanding that the right member be of degree exactly 


(n — 1). 
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where the H; are power series in t, H; beginning with a term in t' or possibly 
higher. (H, definitely begins with a term in 7.) 

For P to be of B-type k, T(z, t) must be a polynomial in z of degree k. Thisis 
necessary and sufficient. On integrating, we get a polynomial of degree k + 1 
inz. Hence we have 

TuHreoreM 5.3. A necessary and sufficient condition that a set P be of B-type k 
is that it be given by (5.12) where H is of the form 


(5.16) H(a, t) = A(t)- exp {xHi(t) + --- + 2°" Heid}, 
the H;(t) being of form™ 
(5.17) H(t) = hit! + hit’ + --- (hu # 0). 


Given a set P, there exists a unique sequence of polynomials {U,(z)}, U, of 
degree not exceeding n, such that 
(5.18) aP,, « UP ua d+: + UP (n = 1, 2, ---), 
(The U,,’s are determined successively if we set n = 1, 2, --- .) 

Derinition. P is of C-type k if the maximum degree of the U,’s is (k + 1). 

If we set 


(5.19) U(x, t) = Do Ual(zt", 
0 
then from (5.12) and (5.18), 
HU = oH 

at 

so that 
‘ \ 

(5.20) H(z, t) = c-exp if U (a, t) *}- 

0 


Here ¢ is an arbitrary (non-zero) constant. Comparing (5.14) and (5.20), we see 
that 


t z 
(5.21) log c + / U(x, t) dt = log A(t) + | T(z, t) dz, 
0 0 


so that 
( U i= A’ [ LJ g d: 
| (z, t) A + ; aj tT) a2, 
(5.22) | ‘ary 
) 
tT (x, t [ —— dt. 
( , 0 0 Ox ; 
Coro.iary 5.3. P is of C-type k if and only if it is of B-type k. 


31 The non-vanishing conditions on the t;; referred to in the preceding footnote become 
non-vanishing conditions on the hy; . 
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For P is of C-type k if and only if the brace in (5.20) is a polynomial in z of 
degree k + 1. (5.20) thus reduces to (5.16), including the conditions (5.17). 

There is no such close link between A-type and B-type. Consider, for 
example, the set 


n 


x 


ni(n + 1)! 
It is of A-type one since L[P,] = Pn, where 
Lly] = 2y’ + zy”. 
(Also, Jo(t) = 2t, J.(t) = @, and M,(t) = (n+ 1)!".) On the other hand, 
a (at)” 
H(z, t) = , 
@) = 2 aD! 
so that log H is decidedly not a polynomial in z. P is therefore of infinite B-type. 
Let P be any set, and L its associated operator (i.e., L[P,] = P,.). From 
(5.18) we have 


P, (x) -_ 


U,L[P,] + slid + U,L"|P,] = nF’. ’ 
so that we get 


Corouuary 5.4. Every set P satisfies an equation of form 
(5.23) Viyl = Do Ue Ly] = dy, 
k=1 


where = nfory = P,. The U,’s are defined as in (5.18), and L is the operator 
associated with P. If P is of B-type k, the coefficients in (5.23) are polynomials 
of maximum degree (k + 1). 

In connection with sets of finite type (according to one definition or another) 
there arises the problem of the application of finite type sets to the solution of 
functional equations. This problem we reserve for another occasion. We shall 
terminate the present section by showing that the Legendre polynomials are 
of infinite type according to all the definitions given. 

The B-type is determined by the maximum degree of the polynomials 7’, (2x) of 
(5.11). Using relations (5.12) to (5.14), where H = (1 — 2tx + t’)*, we find 
that 


(a) T(z, t) = >> T,(x)t" = (1 — Qtr + FY", 
so that 7’, satisfies the recurrence relation 
(b) as — 2xT, -1 + T»~2 = 0, n> 0. 


With the initial conditions 7) = 1, 7; = 22, the solution of (b) is 


(c) T,(x) = 5 {7 +0)" —~ (2-6), 9 = (2? — 1), 
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It is seen that 7',(x) is of degree n, so that {X,(x)} is of infinite B-type (and 


C-type). 
Now consider the A-type of {X,}. If L is the associated operator then, 


(d) L[X,] = Xn, 

where 

(e) Lly] = Lo(x)y’ + Li(z)y” + ---. 
Multiplying (d) by t” and summing from n = 0 ton = ~, we obtain 
(f) L[H} = tH, H = (1 — 2+)". 


Relation (e), for y = H, simplifies to 


_< 1-3... (Qn + 1)". 
‘e) we x Ln(e) (1 — 2t2 + #4’ 


and if we set 


t 
(h) a or 
this becomes 
(i) x {1 + Qrr — (1 + 4d + 4(x? — 1)r”)!} = 501-3 --- (Qn + DA" L, (2). 


Since (g) is an identity in the variables t, z, so is (i) an identity in the variables 
A, xz. If {X,} is of finite A-type, the right member, and therefore the left, isa 
polynomial in z. This is manifestly untrue. Hence {X,} is of infinite A-type. 


PENNSYLVANIA STATE COLLEGE. 
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ANNIHILATOR IDEALS AND REPRESENTATION ITERATION FOR 
ABSTRACT RINGS 


By C. J. Everett, Jr. 


1. Introduction. It has been shown’ that in a ring ly with additive group 
My, every element g acting as a left multiplier on Ty) , defines an operator A in 
the endomorphism ring Q of Mo, such that gf = ATo, glo being the ordinary 
ring multiplication, and AT) the map of Mo by the operator \. The set of all 
such operators \ forms a subring ¥ = X(T) of Qo to which Ty is ring homomorphic. 
We shall call &% the left representation of To. Similarly the right multipliers 
define a right representation RT of lo, where RT consists of a ring inverse iso- 
morphic to the subring ® of operators of Q defined by the right multipliers; 
R? being used in order that Ty be ring homomorphic to #7 in the ordinary sense. 
We have new rings %, #7, each having an additive group with corresponding 
operator rings. We are thus free to iterate the process, forming left or right 
representations of representations in any order. We shall study these repre- 
sentation rings and their isomorphism to residue class rings of ly modulo certain 
annihilating ideals.” 

2. The ideal theory. Let I’ be an abstract ring, and define =” as the set of all 
products 8; --- Sm of m factors, s;¢ 2 CT. Denote by (r, J) the set of all x of T 
such that I’'zI'' = 0 (r, 1 = 0,1, ---). T° is merely deleted wherever it occurs 
formally. Thus (0,0) = 0, and (0,1)F = 0. It is clear that (r, J) is a 2-sided 
ideal in T and (r, 1) C (r + 8,1 + k) for all s, k. If A is a 2-sided ideal in I, 
we write  — A for the residue class ring of T mod A. 

Let \ be the least 1 for which (0,7) = (0,1 + 1), p the least r such that (r, 0) = 
(r+ 1,0). Tis said to be of L-type X, of r-type p. If = 0,T is called l-definite; 
if p = 0, r-definite; and if both, definite. Conditions on I for finiteness of type 
are given in §3. We shall consider only rings with finite p, A. 

TueorEM 1. Jf in '—(r, I), (s, k) = 0, then in T—(r — i, l — 9), (97) = 

(+ s,7 + k), and conversely. 
Suppose I **zt?** = Oinl—(r — i,l — j);ie., Per’ = Oin’ and zI* = 
0inl'—(r, 1). Hence x = 0 in the latter ring, and zl = OinT. Hence in 
l — (r — i, l — j), Mat’ = 0, and in this ring (¢ + s, 7 + k) C (7,7). The argu- 
ment is reversible. 

Received February 24, 1939. 

‘Rings as groups with operators, Bull. Amer. Math. Soc., vol. 45(1939), pp. 274-279. 

*The extension of the theory to rings with rings of operators is readily made. Thus 
in the case of linear associative algebras 2 and 7 are the usual left and right regular 
matrix representations. 
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Coro.tiary 1. T—(0, l) is l-definite if and only if (0,1) = (0,1 +1). T'—(r,0) 
is r-definite if and only if (r,0) = (r + 1,0). 

Coro.uary 2. I'—(0, A) is Ldefinite, T—(p, 0) is r-definite, and p, d are the 
least such integers. 

THEOREM 2. (r,l) = (r,l + 1) implies (r + s,l) = (r +8,1+ 1) and (r, I) = 
(r + 1,1) implies (r, 1 +k) = (r+1,1+ 4). 

This is immediate from definition of (r, 1). 


Corouuary 3. T — (r, l) is definite if and only if (r, 1) = (r + 1,14 1). 


The definiteness with Theorem 1 yields (r, 1) = (r,1 + 1) = (r + 1, 1), whence 
(r, l) = (r + 1,1+ 1) by Theorem 2. The converse follows from Theorem |! 
after noticing that (r, 1) = (r + 1,1 + 1) implies (r, 1) = (r,1+ 1) = (¥ + 1,). 


Coro.uary 4. I'—(p, A) is definite. 

For (0, 4) = (0, A + 1) and (p, 0) = (p + 1, 0) imply (p, A) = (p, A + 1) = 
(o + 1,4 + 1), by Theorem 2. 

THEOREM 3. If A is a 2-sided ideal in T, then T — A l-definite implies A D 
(0, 1); T — A r-definite implies A > (r, 0); and I — A definite implies A > (r, )), 
all r, 1. 

Suppose in — A, (0,1) = (1,0) = 0;i.e., 2! = 0 (A) implies z = 0 (A), and 
l'z = 0 (A) implies z = 0 (A). Then for every r, l, '’zI' = 0 (A) implies z = 
0 (A). Since I’(r, DT’ = 0 A, (r, 1) C A. Similar proofs establish the rest of 
the theorem. 

Coro.tuary 5. (0, A) = (0,4 + J), (p, 0) = (9 + 17, 0), and (s, k) = (s +1, 
k + lL) for all s, k for which T —(s, k) is definite. 

Use A = (0, A), (p, 0), (s, &) consecutively in Theorem 3. 


Coro.iary 6. (0, A), (p, 0), (p, A) are “minimum” ideals for which the cor- 
responding residue class rings are I-definite, r-definite, and definite, respectively; 
i.e., each of these ideals is contained in every ideal having the corresponding property. 


We may now make several remarks concerning the lattice of ideals (r, J) in 
matric array, r, | serving as row and column indices, respectively. If !—(r, } 
is definite, (r, 1) defines a definite point of the lattice. A definite point for which 
there exists no definite point (s, k), s < r, k < l, is a frontier point, the set of all 
such constituting the frontier of the lattice. The concept is significant in the 
iterated representation theory. Roughly, the frontier is the boundary between 
definite and non-definite points. It is to be noted that the first repetition in any 
row cannot occur to the right of the first in the preceding row. A similar remark 
applies to columns. All definite points define equal ideals. 

Since in a commutative ring, A = p, (a, 0) = (0, a) and (a + b, 0) = (a, ), 
it is seen that the frontier here is of step form, consisting of points (a, b) with 
a+b = X, and (ce, 9), (0, d), c,d 2 ». A nilpotent ring of index a ([ = 0) 
has a similar frontier witha + b=a-—1l1,c,d2a—1. 
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3. Iterated representation theory. Consider the sequence of homomorphic 
representation rings 


(1) Tow Ti ~w~--s ~ Ta ~ Tag ~~ +++ ~ Tapp ~~: 


obtained by taking successively left representations of left representations of Ip 
as indicated in §1. We shall need a notation L(2,: 2) for the set of all elements g 
of a ring I such that g=, C 22, where 2, , LY: are subsets of f. If 2, has the 
property that '2,C 2%, , and if 22 is a left ideal in’, L(21: 22) is a 2-sided ideal in 
r. Thus L(f":0) = (0, n) in the previous notation, and L(T,,:0) is the set of all 
elements of I’, corresponding to 0 of I’; in (1); i.e., a ring I’, is /-definite in case 
L(T,:0) = 0, whence T, & I'n41. For convenience we define L (r3:0) = 0 
Note that for 2; , 22 as above, L(2j': Ze) is a 2-sided ideal in [ and aoe min 723) = 
L(27:L(2?:22)) since each set is contained in the other. We prove the 


LemMA. In the correspondence T, ~ T'ni1 of (1), the set in T,, corresponding to 
L(0243:0) in Tags is L(T2**:0). 

For p = 0 the lemma is trivial under the convention L(I‘)4::0) = 0. Let 
p=l. Ift—> 2% ¢€ L(V n41:0), then &,—- Try, = 0. Hence &¢ L(T,:L(T,:0)) 
= L(?:0). Conversely, if ¢ « L(1%,:0) and ¢ > #’, 7, > €Tray. But &,C 
L(T,:0) — 0, and @’Ty4, = 0, whence &’ ¢€ L(T,4::0). Assuming the lemma for 
p =k, we prove it forp=k+1. If &— & € L(V23}:0) = Lagi: L(0241:0)), 
then £1, — &Trag. C L(0241:0). Hence by hypothesis ¢f, C L(T2**:0) and 
te L(T,:L(P2"':0)) = L(r2**:0). Conversely, if § ¢ L(T2"":0) and ¢ — ¢’, then 
Tf, > €Tai1, where f, C L(T?*':0) > L(P24::0). Hence we see that 
fe L(Payi2L(0241:0)) = L(12t1:0). 

We have immediately 

THEorEM 4. In the homomorphism T, ~ T,4n induced by (1), L(T?*":0) > 

L(T?4n20) andl, — L(T?:0) STr4n. In particular, T) — L(Ts:0) ST. 


CorotitarRy 7. Jf (0,1) = (0,1 + 1) inTo, 2e.,T — (0, 1) L-definite, then (0, 7) 
= (0,7 + 1) inTy) — (0,1 — 7) =P; for all j; and conversely, if (0,7 ) = (0, 
j+1) inf — (0,1 — j) for any j,T — (0, l) is l-definite. 

Leti = s = r = 0,k = 1 in Theorem 1. 

THEOREM 5. A necessary and sufficient condition that Ty — (0, n) be l-definite 
is that for some l, k 

L(Vi:0) = L(ri*':0); 
ie, in Ty — (0, k) =T;, (0,2) = (0,2 + 1). 


TuHEorEM 6. A sufficient condition that — (0, n) be l-definite is that for some k, 
ly — (0, k) > Ty have the maximal chain condition for 2-sided ideals. 


This is a weaker condition than the possession of the chain condition by Ty 
itself. If !y ~ T, are any homomorphic rings, a chain condition on Ty implies 
one on I; , but not conversely, even for the rings here concerned. For example, 
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consider the null algebra Ty ([$ = 0) with infinite basis (wm , ue, --+). Here 
lr, = 0 has a chain condition, but not To, e.g., (wu) C (ute) C --- is non- 
repeating.” 

Rings with maximal chain condition for ideals are thus of finite L-type A. In 
particular, if [ is a linear associative algebra, the theory applies with obvious 
modifications, \ being S a, where a is the index of the algebra. As noted in 
§2, for a nilpotent algebra, \ = p = a — 1, and (p, 0) = (0, A) = 

A strictly analogous theory may be deduced for right representations of right 
representations of I. We shall now study the case of mixed iterates, left and 
right representations being arbitrarily derived from preceding ones. 

Denote by &(T) and R7(T) the left and right representations of a ring T. We 
may consider the iterated homomorphisms: 


(2) r~ LP) ~ R7(L(L)); r~ R(T) ~ L(R7(L)). 


While R7(L(T)) and L(M7(T)) are conceptually distinct, R(L(T)), L(M7(T)) 
being subrings of the endomorphism rings of the additive groups of &(T) and 
R(T), respectively, they have the property given by 


THeoreM 7. L(R™(T)) S R7(L(T)) =T — C1, 1). 
This is a special case of the more general 


THEOREM 8. The ring resulting from the iteration of l left and r right representa- 


tions in any mixed order is independent, up to isomorphism, of the order in which 
the iteration is performed, the resulting ring being always isomorphic with T — (r, l). 


For inl ~ Ys ~My wees SLEW RL) ~ --- ~ RF (L), where &; denotes 
L(Li1) and Ri (L,) denotes R7(R7_y(L,)), the set in L, corresponding to 0 in 

(2,) is R(Vi:0). Now inT, let €— & € R(Mi:0) in&,. Then I"t > Ve’ = 0. 
Hence £ e R(I":L(1':0)) = (r, J). Conversely, we may show that every element 
of (r, 1) +0 in R7(L,). The same result may be obtained for T ~ 2,(R7). Hence 
r = | = 1 gives Theorem 7, which in turn establishes the independence of order 
for Theorem 8. The significance of the frontier concept is now clear, it being 
necessary to take as many left and right representations (in any order) as will 
yield an ideal on or past the frontier, in order to obtain a definite representation, 


4. Rings with units. Such rings give rise to special theorems: 


TueoreM 9. If e, is a left unit for T, then (1,0) = 0 inT, e, + (0, 1) is the set 
of all left units of T, and e; is unique if and only if T is L-definite. T — (0, A) # 
definite and has a principal unit. 

For 'x = O implies z = Osincee, eT. (e, + (0, 1)) ! =I so every element of 
e: + (0, 1) is a left unit of ©. Moreover, if ¢; is also a left unit of I’, then (e; - 
e)! =f —T = Oand e; = e, modulo (0, 1). Since for’, p = 0,1 — (0, A) is 


>] am indebted to George Whaples for this suggestion. 
* In this case we use the notation R(2,:2_,) for all g e [ such that Sig Cy. 
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definite (Corollary 4). Hence the class [e;] is a unique left unit for this ring. 
It is thus a principal unit.” 

THEOREM 10. Jf IT is r-definite, the frontier of T consists of (0, X + 7), and 
(r,A), allt, r. The frontier of a definite ring consists of (0, 1), (r, 0), alll, r. 


Rings with left unit, and rings with unit afford examples of these two types. 


5. Representation of the center. We shall now consider the relation between 
the center Z of I and the intersection & NM MR of 2 and KR, these being subrings of 
the common over-ring 2, the endomorphism ring of the additive group of I’, as 
indicated in §1. Recall that [! — § (inverse homomorphic: 7; — pi , 72 > pe 
implies 7172 — p29). 

THEOREM 11. & 1 & is in the center of 2 and of R and contains the map of Z 
under both’ ~ LandT ~ ®. 

Since I(r) = (IL)r, it is clear that YR = RY element-wise. Hence every 
element of £ M 9 is commutative with all the elements of and of R. Moreover, 
ifzisin Z, 2!’ = l'z and z defines the same operator whether acting as left or right 
multiplier, i.e., z maps into the same operator in both correspondences of the 
theorem, and its map is thus in 2M R. 

THEOREM 12. Jf IT is definite, the correspondences of Theorem 11 are one-one, 
£1 R is the center of L and of R, and is the (left, right) representation of Z. 

Let z < £ in the center of %. Because of the isomorphism, & = Yé implies 
a2’ =Tz,andzisinZ. By Theorem 11, &¢8. Hence the center of Lis 2M R. 
Similarly for #.° 

Coro.tiary 8. Ina definite ringl, 2! = ly elementwise implies x = y. 

zl = l'y means that x and y correspond to the same operator @ which is thus in 
Land in R, thusin 2 R. zis therefore in Z, 2! = 2,0x = ly, T(y — x) = 0, 
and y = x since I is definite. 

An elementary modul’ may be defined, for the purposes of this note, as one for 
which there exists a ring [ with unit element having the modul as additive group, 
and such that every endomorphism of the additive group is defined by a left 
multiplier. 

Coro.tuary 9. The endomorphism ring of an elementary modul is commutative. 


Let x eI’; since z as right multiplier on I’ defines an endomorphism of the 
additive group, there exists a y such that yf! =Tx. By Corollary 8, z is in the 
center of [. But x was arbitrary. 

UNIVERSITY OF WISCONSIN. 

‘If e is a unique left unit of a ring R, e is a right unit. For (e + re — r)R= 
R+rR—rR = Rimpliese + re —r=eforallr eR. This proof isdue to W. L. Mitchell. 

* This is in agreement with the well-known fact that the left regular representation 
of a commutative linear associative algebra with unit element is identical with the trans- 
pose of the right regular representation. 

’ Rings as groups with operators. 











PSEUDO-NORMED LINEAR SPACES AND ABELIAN GROUPS 
By D. H. Hyers 


In his study of linear topological spaces’ J. von Neumann has introduced the 
notion of a pseudo-metric, which can be defined in any locally convex’ linear 
topological space. It is a real-valued function and has many of the properties 
of a norm, including the fulfillment of the triangular inequality. The pecu- 
liarity of this pseudo-metric is that its value depends not only on the element z 
of the space but also on a neighborhood U out of a system of neighborhoods 
of the origin. In the present paper the pseudo-metric is generalized in the 
following way. First, the triangular inequality is replaced by a much weaker 
condition of continuity, and secondly, the neighborhood system is replaced by 
any “strongly” partially ordered set, and this makes it possible to postulate a 
“‘pseudo-norm”’ for a linear space without first postulating a topology. 

In the first section of the paper pseudo-normed linear spaces are defined and 
shown to be the same as linear topological spaces. The pseudo-norm includes 
as special cases both the pseudo-metric of von Neumann and the pseudo-norm 
previously defined by the author in [3]. 

Pseudo-normed Abelian groups are defined in the second section of the paper, 
and it is shown that a topological Abelian group G has a “linear topological 
extension” if and only if the topology of G can be generated by a pseudo-norm. 
Thus pseudo-normed Abelian groups are just those Abelian groups which are 
topological subgroups of a linear topological space. 


1. Let D be a set with elements a, b, c, --- which has the composition prop- 
erty of Moore and Smith. That is, we are postulating that (i) if a > b, then 
b > a; (ii) if a > b and b > ec, then a > ¢; (iii) given a and b there exists ¢ 
such that c 2 aandc = b. The set D together with the relation > will be 
called a strongly partially ordered space.’ 

A linear space L will be said to be pseudo-normed with respect to D if there 
exists a real-valued function n(z, d) defined on LD which satisfies the following 
postulates: 

(1) n(x, d) 2 0; n(x, d) = 0 for all de D implies x = 6, where @ represents 
the zero of L; 

(2) n(ax,d) = a} n(z, d) for all real a; 


Received March 5, 1939; presented to the American Mathematical Society, September 6, 
1938. 
1 See [5]. The numbers in brackets refer to the bibliography at the end of the paper. 
? Local convexity is equivalent to von Neumann’s convexity. See p. 158 of [6]. 
* Such sets were first used in general topology by G. Birkhoff. See [2]. 
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(3) given » > 0, ee D, there exist 6 > 0, de D such that n(x + y, e) < 9 
for n(x, d) < 6 and n(y, d) < 4; 

(4) n(x, d) = n(x, e) whenever d > e. 
We shall call n(x, d) the pseudo-norm of x with respect to d. 

As an example of a pseudo-normed space which is not a normed space, take L 


to be the space of all real sequences x = (2%, 22, 23, --- ) and D to be the set 
of positive integers. Define n(x, d) by the equation n(x, d) = max | 2; |. It 
lsisd 


is clear that all the postulates for a pseudo-normed space are satisfied. Another 
example is the space‘ of all measurable functions x(¢) on the real open interval 
(0, 1). If we take D to be the interval (0, 1) with ordering as usual, we may 
define the pseudo-norm n(z, d) by the equation 


n(x, d) = g.l.b. {lu.b. | 2@ | }, 
E 


m(E)>d_ te 


where £ is a variable measurable subset of (0,1). In the above sequence space 
the pseudo-norm satisfies the triangular inequality, while in the space of measur- 
able functions the pseudo-norm does not satisfy the triangular inequality, but 
only the weaker condition (3). In either case the topology given by the pseudo- 
norm, where z is a limit point of a set S if given 6 > 0, de D there exists a 
point ye S, y # x such that n(y — 2, d) < 4, is equivalent to the usual metric 
topology for the space.” 

By a linear topological space (I.t. space) we mean a linear Hausdorff space 
in which the fundamental operations z + y and ez are continuous. It is known’ 
that the following definition, essentially that of von Neumann, is equivalent to 
the above definition. 

A linear space L will be called a 1.t. space if there exists a fundamental system 
\l of subsets U of L such that the following postulates are satisfied. 

I. The intersection of all the sets of U is @. 

II. Given U ¢ Wand V ¢ U, there is a W e U such that W CU.-YV. 

lil. Given U ¢ Ul, there is a V e Ul such that aV C U for all @ satisfying 
-iS a2 $ 1. 

IV. For each U e Uthereisa Ve Uwith V+V CU. 

V. Given xe L and U ¢ U, there is a real number a@ such that zeal’. 

These postulates are convenient in proving Theorem 1 below. The sets U, 
which are not necessarily open, will be called neighborhoods of @. Any system 
W’ of sets will be called equivalent to U if the Hausdorff equivalence criterion 
holds, that is, if for any U ¢ Ul there is a U’ « UW’ with U’ C U and conversely, 
given V’ ¢ ll’ thereisa V e Uwith V CV’. Any system U” which is equivalent 
to U clearly satisfies the Postulates I-V. 


‘ Actually the elements of the space are equivalence classes of functions, where two 
functions are equivalent if they differ at most on a set of measure zero. 

* See [1], pp. 9, 10, for the definitions of metrics in these two spaces. 

® The equivalence is proved in [4]. 
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Let L be a 1.t. space which is pseudo-normed with respect to a strongly par- 
tially ordered space D. The pseudo-norm n(z, d) will be said to generate the 
topology of L providing the sets’ U(d, a) = [x; n(x, d) < al, de D, a > 0, form 
a system equivalent to the fundamental neighborhood system U. 


THEOREM 1. Every pseudo-normed linear space L is a linear topological space 
in which the pseudo-norm generates the topology of L. Conversely, given any linear 
topological space L, there exists a strongly partially ordered space D with respect 
to which L may be pseudo-normed in such a way that the pseudo-norm generates 
the topology of L. 

Proof. Let L be a pseudo-normed space and consider the subsets U’(d, a) = 
|x; n(x, d) < a]. The intersection of all the U(d, a) is the origin by Postulates 
(1) and (2). Given U(d, a) and U(e, 8), let f be an element of D such that 
f > d,f > e, and let y be a positive number less than a and 8. For z e U(f, y) 
we have n(z, f) < y. From Postulate (4) it follows that n(z, d) < a@ and 
n(x,e) < B. Hence U(f, y) C U(d, a)-U(e, 8), so that Postulate II is satisfied. 
The continuity of the sum (Postulate IV) follows from Postulate (3). It follows 
at once from Postulate (2) that Postulates III and V are satisfied. 

Conversely, let L be a |.t. space, and let U be the set of neighborhoods satis- 
fying Postulates I-V. For each U ¢ U and each a > 0 let V(a, U) be the set 
of all elements of the form 8x, where 8 ranges over the closed real interval 
[—a, a] and z varies over U’. Let WW’ be the family of all the V(a, U). It 
follows from Postulate III and the continuity of the function Bz that WW’ is 
equivalent to U. The system Ul’ evidently has the following properties: 
(a) V e€ UW’ implies BV C V for | 8! S 1; (b) if Ve U’ and B # 0, then BV ¢€ UW; 
(c) UW’ satisfies Postulates I-V. As the partially ordered space take the system 
U’ of neighborhoods, where V; 2 Ve means V; C Vz. Clearly Wl’ is strongly 
partially ordered by the relation >. For each re and each V el’ put 
nix, V) = gl.b. a > 0, reaV. Then n(z, V) is obviously non-negative. 
Since U’ has property (a) it follows that 8 > n(z, V) implies x «BV. Hence if 
n(x, V) = O for all V ¢ WW’, then z ¢ V forall V ¢ UW’ so that z = @ by Postulate I. 
This proves that (1) is satisfied. To show that (2) holds first take 8 > 0. Now 


n(8z, V) = g.l.b.a, Br eaV =g.l.b. a, re*V= a(g.. Tt v) = Bn(z, V). 
B B B 
For 8 < 0 a similar argument holds with a change of sign, since V = —V, 
and finally it is clear that n(@, V) = 0. In order to prove that Postulate (3) 
is satisfied let 7 > 0, V; e U’ be given. Take m < 7. Since Wl’ satisfies Postu- 
late IV and since mV; Ul’ there exists V2¢U’ such that Ve + Ve C mVi. 
Now if n(z, V2) < 1 and n(y, V2) < 1 we have re V2, y € V2 and therefore 
n(x + y, Vi) S m < ». To prove that n(z, V) has the monotonic property (4) 
take V; > V2,i.e., Vi C Ve where V; # V2, Viel’ (§ = 1,2). Foralla> 


7 The symbol [z;__] represents the set of z’s having the property indicated after the 


semicolon. 
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n(z, V2) we have reaV; CaV2. Hence n(x, V2) 2 n(x, Vi). It remains to 
prove only that Ul’ is equivalent to the set of neighborhoods U(V, a) = 
(x; n(x, V) < al, a > 0. Given any set U(V, a) take 8 < a. Then for any 
zeBV we have n(z, V) S B < a. Conversely, if V « Ul’, then n(z, V) < 1 
implies that x ¢ V. The proof of the theorem is now complete. 

A pseudo-norm will be called triangular if it satisfies the triangular inequality 


(3a) n(x + y, d) S n(x, d) + ny, d) 


forallzandyinZandalldinD. The inequality (3a) clearly implies Postulate 
(3). The pseudo-metric of von Neumann satisfies Postulates (1), (2), (3a) and 
(4), and hence is a triangular pseudo-norm. 

Since 1.t. spaces and pseudo-normed spaces are the same it is natural to try 
to express properties of |.t. spaces in terms of pseudo-norms. The next theorem 
affords a definition of local convexity in terms of pseudo-norms. 


THEOREM 2. A linear topological space is locally conver if and only if its 
topology can be generated by a triangular pseudo-norm. 


Proof. The necessity was proved by von Neumann on pp. 18, 19 of [5]. 
The sufficiency follows at once since the triangular inequality (3a) implies that 
the set U(d, a) = [x; n(x, d) < al] is convex. 

A pseudo-norm n(z, d) which is independent of the element d of D reduces 
to the “‘pseudo-norm”’ defined in [3]. Hence in view of Theorem 3 of [3] we see 
that a 1.t. space contains a bounded open set if and only if its topology is gen- 
erated by a pseudo-norm independent of d. Finally it is obvious that a pseudo- 
norm is a norm in case it is both triangular and independent of d. 


2. By a topological group we shall mean a group which is a Hausdorff space 
such that the group operation and the inverse operation are continuous. A 
topological Abelian group G (written additively) will be called a topological 
subgroup of a linear topological space L if G is a subgroup of L and if the criginal 
topology of G is equivalent to the topology for G obtained from L by relativiza- 
tion.” A lt. space L which contains a topological subgroup G’ which is con- 
tinuously isomorphic’ to a topological group G will be called a linear topological 
extension of G. Not every topological Abelian group containing no elements of 
finite period has a linear topological extension. For example, the additive 
group of all complex numbers topologized by means of the metric p(x, y) where 
p(z, y) = 1 for  ¥ y and p(z, x) = 0 for all z is a topological group which 
has no I.t. extension. 

Let G be an Abelian group, written additively, and let D be a strongly par- 
tially ordered space. The group G will be said to be pseudo-normed with respect 


* Le., the topology in which the open sets in G are the intersections of open sets in L 
with G. 
* L.e., both the isomorphism g — g’ and its inverse are continuous. 
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to D if there exists a real-valued function n(g, d) defined on GD which satisfies 
the following postulates: 

(A) n(g, d) = 0; n(g, d) = 0 for all d e D implies g = 6 where 6 represents 
the zero of G; 

(B) n(vg, d) = | v| ng, d) for any integer v; 

(C) given » > 0, e€ D, there exist 6 > 0, de D such that nig, d) < v, 
n(h, d) < v6 imply n(g + h, e) < vm for all positive integers ». 

(D) n(g, d) 2 n(g, e) whenever d > e. 

From Postulates (A) and (B) it follows that n(@, d) = 0 for all de D and 
that a pseudo-normed group has no elements of finite period. It is clear on the 
basis of Theorem 1 that any l|.t. space is a pseudo-normed group. Every 
pseudo-normed group G is a topological group since the pseudo-norm generates 
a topology for G just as in the case of linear spaces. 

TuroreM 3. The topology of every topological subgroup of a linear topological 
space may be generated by a pseudo-norm. Conversely, every pseudo-normed 
Abelian group has a linear topological extension. 

Proof. The first statement follows at once from Theorem 1. To prove the 
converse we must construct a 1.t. extension L for a given pseudo-normed 
group G. 

We first extend the multiplier domain of G from the integers to the rationals. 
Consider the set of all triples (u, v, g), where » and » are integers, v ~ 0, and 
gis anelement of G. Two triples (x, v, g) and (u’, v’, g’) will be called equivalent 
(written (u, v, g) & (u’, v’, g’)) in case wr’g = p’vg’. It is clear that this relation 
is reflexive, symmetric and transitive. We define the sum of two triples by 
the equation 


(mu, V; g) + (p, T; h) = (1, v7, urg + pvh) 


and the product of a rational number p/7 and a triple by the equation (p/7) 
(u,v, g) = (pu, rv, g). It is easily shown that the equivalence relation = is a 
congruence relation with respect to the operations of addition and of multi- 
plication by rationals. That is, we can define the sum of two equivalence 
classes defined by the relation & as the equivalence class of the sums, and the 
product of a rational and an equivalence class as the equivalence class of the 
products. The set of equivalence classes thus forms an Abelian group G with 
rational multipliers, and it is obvious that the subgroup consisting of those 
elements of G which have representatives of the form (1, 1, g) is isomorphic 
with G. Since G is supposed to be pseudo-normed, we can define a pseudo- 
norm for the supergroup G. Let z be any element of @ and let (xu, v, g) be 
any chosen representative of z. Let d be any element of the strongly partially 
ordered space D associated with G. Put n(z, d) = | u/v| ng, d). It is easily 
verified that n(z, d) is independent of the representative of x selected. On 
making use of Postulates (A)-(D) on the pseudo-norm of G, one can show 
without difficulty that n(z, d) satisfies Postulates (1)—(4) in §1 with the ex- 
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ception that in Postulate (2) the multiplier a is now restricted to be rational. 
Thus G is a pseudo-normed group which obviously contains a subgroup con- 
tinuously isomorphic to G. 

In order to obtain a I.t. extension of G we use the method of Cantor sequences. 
Define the relation between countable sequences out of G as follows: (z,) ~ (yy) 
means that given de D and p > 0 there exists an integer wo such that u > po 
and vy > wo imply n(x, — y,, d) < p. A sequence (z,) will be called funda- 
mental if (7) ~ (a,). Clearly the relation ~ is an equivalence relation among 
fundamental sequences. Let L denote the space whose elements are equiva- 
lence classes of fundamental sequences under the relation ~. Define addition 
of two fundamental sequences by the equation (z,) + (y,) = (% + y,). Since 
the pseudo-norm for @ satisfies Postulate (3) it follows that (x, + y,) is a 
fundamental sequence and that the relation ~ is a congruence relation with 
respect to addition. Hence we may define the sum x + y of any two elements 
z= {(z,)} and y = {(y,)} of L as the equivalence class {(x, + y,)}. Let 
(a,) be a fundamental sequence of rational numbers and (z,) a fundamental 
sequence out of G. Define (a,) o (z,) = (a,2,). To demonstrate both that 
(a,t,) is a fundamental sequence and that ~ is a congruence relation with 
respect to the operation o we need only to prove that if (a,) is equivalent to (a,) 
and (x,) ~ (x,), then (a,2,) ~ (a,2,). Since G satisfies Postulate (3) it follows 
that for any chosen ee D and n > O there is a de D and a p > O such that 
n(iyi,d) < p(t = 1, 2,3) implies n(y: + ye + ys,e) <n. Since (x) is funda- 
mental there exists an integer v such that n(x, —z,,d) <pforuz>v. Now 
let », > v be chosen so large that \ > », and uw > » imply that 


n(ay(x, — z,), d) <p and |a—a,| < min E . |. 
n(x,, ad) 
Since az, — ant, = an(%, — 2.) + (a — a,)(x, — 2) + (ma — @)a,, we 
see that n(ayx, — a, 2, ,¢) < nforkA >»,u >. Areal number according to 
the Cantor construction is an equivalence class of fundamental sequences of ra- 
tionals. Since the relation ~ is a congruence relation with respect to the opera- 
tion o, we may define the product az of an element x = {(x,)} of L by a real num- 
bera = {(a,)} as the element {(a,x,)} of L. With these definitions of addition 
and of multiplication by reals it is clear that Lisa linear space. Let U(d, p) be 
the set of all points z of L for which there exists a representative (2z,) of x and an in- 
teger o such that n(z,,d) < pforall zs > o. Weshall now show that the set U of 
all the sets L’(d, p) satisfies Postulates I-V in $1, and this will prove that L is a L.t. 
space. That U satisfies I, Il and IV follows without difficulty. To show that III 
is fulfilled let x be any point of U(d, p) and let (z,) be a representative of z with 
n(z,, d) < p for r > o. The pseudo-norm for G satisfies Postulate (2) for 
rational multipliers. Hence n(—2z,, d) < p for r > o so that —zr = 
((—2,)} « U(d, p). For any real @ in the interval —1 < a < 1 let (a,) bea 
sequence of rationals converging to a. Then for some w > o it is true that 
tT > w implies that |a,| < 1. Hence for r > w we have n(a,z,, d) < p. 
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Therefore for all a satisfying —1 S a S 1 we have al’(d, p) C U(d, p). To 
demonstrate that V is fulfilled let ze L, e¢ D and n > 0 be given. Let deD 
and 6 > 0 be chosen in accordance with Postulate (3) for pseudo-norms. Let 
(z,) be any representative of z, and choose o so that + > o implies that 
n(x, — x,,d) < 6. Let B be a positive rational < min (1, 6/n(z,, d)). Since 
Bx, = B(x, — 2.) + Ba, and since n(8(z, — 2z.), d) < 6 and n(Bz,,d) < 6 for 
t > ao, it follows from the choice of d and 6 that n(8z,, e) < n for all r >a. 
That is, Bx e U(e, n) where 8 # 0. On taking a = 1/8 Postulate V is seen 
to be satisfied. 

We have shown that L is a |.t. space. Moreover, the pseudo-normed group 
G is clearly continuously isomorphic to the subgroup of L consisting of those 
equivalence classes containing a representative (z,) such that z, = 2; for all r. 
Since the original group G is continuously isomorphic to a subgroup of G, it 
follows that L is a l.t. extension of G. This completes the proof of the theorem. 

The extension L of G is not in general a complete space. If a complete ex- 
tension is desired, the group G should be extended as described in [2] by G. Birk- 
hoff, using directed sets in the place of countable sequences. 
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A MEAN ERGODIC THEOREM 
By NELSON DuNFORD 


In this note we shall give an ergodic theorem of the von Neumann type’ for 
continuous n-parameter uniformly bounded semi-groups of linear transforma- 
tions in a Banach space. The method of proof is an extension of one used by 
F. Riesz’ and K. Yosida’ for the discrete case in one dimension. 

Throughout the note we shall use the following notation and terminology. 


E, is Euclidean n-space, E;, is the set of points a = (a,,--- ,@,) in E, with 
a, 20( =1,---,n),I,isa cube of sider > 0in EZ,. That is, J, is the set 
of points a = (a, ---,a,) € £, for which 

(1) ¢Sa5¢+8r (j =1,---,n), 
where c} (j = 1, --- , m) is an arbitrary real function defined for r > 0. When 


we are dealing with functions defined only on E; , it will be assumed that all 
cubes mentioned are in E;,. We shall sometimes use the symbol V, for the 
volume r" of acubeJ,. A set 7’. (a ¢ E,) of linear transformations in a Banach 
space B is said to form a semi-group in case 


(2) T a+8 ia Tal's ’ a, B € E,. ’ 


and in case 7’, is defined for every a ¢ E, , it is said to form a group if To is the 
identity and equation (2) holds for all a, Be E,. A group (or semi-group) 7. 
is said to be uniformly bounded in case || T. || is bounded in a, and it is said 
to be weakly measurable in case the numerical function #7'.2 is measurable (in 
the sense of Lebesgue) for every xz ¢B and #¢% (the space conjugate to ¥). 
A function x, defined on E, (or E;) is said to be almost separably valued in case 
there is a set Ey of measure zero such that the set of points r7., @a@ Ky, isa 
separable subset of B. If for every cube J, of side r > 0 we have a point 
y(I,) « B, then the set of all y(J,) is said to be weakly compact with respect to 
r— © in case every particular set J, (0 <r < «) of cubes contains a sequence 
I,, with r, > © and y(J,,) converging weakly to an element of 8. A function 
tq defined on a cube is said to be measurable in case it is the limit almost every- 


Received March 18, 1939. The results in this paper were communicated in the note An 
ergodic theorem for n-parameter groups, Proc. Nat. Acad. Sci., vol. 25(1939), pp. 195-196. 

‘J. von Neumann, Proof of the quasi ergodic hypothesis, Proc. Nat. Acad., vol. 18(1932), 
pp. 70-82. 

?F. Riesz, Some mean ergodic theorems, Jour. London Math. Soc., vol. 13(1938), pp. 
274-278. 

*K. Yosida, Mean ergodic theorem in Banach spaces, Proc. Imp. Acad. Tokyo, vol. 
14(1938), pp. 292-294. 
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where of step functions, and it is said to be absolutely integrable in case it is 
measurable and || x, || is integrable in the sense of Lebesgue. Finally, since 
most of what we shall say is true for semi-groups on £;, as well as for groups 
on E,, , we shall use the letter G to stand for either EZ, or E,, ; so that a statement 
concerning a semi-group 7’, on G is meant to cover both cases. 


THEoreM 1. Let B be a Banach space and T . (a ¢ G) a weakly measurable and 
uniformly bounded semi-group of linear transformations in 8. If for each x in ¥ 
the function Tx is almost separably valued, then Tx is absolutely integrable over 
every set E < G of finite measure. Let M be the set of those x eB for which 


(i) — SF | Tx da 
V. J, 


is weakly compact with respect tor —~ «~. Then M is a closed linear manifold 
in B and there is a continuous linear mapping y = Ux of M into itself with 
U~ = U and such that 


(ii) T,Uz = Uz, re M, a eG, 
(iii) Usotas | Tide reM, 


(iv) UM ts the closed linear manifold consisting of those x in B for which Tx = 2 
aeG. 

If the space 8 satisfies any one of the following conditions: 

(a) its unit sphere is weakly compact, 

(b) it is reflexive, 

(c) it has an equivalent uniformly conver norm; 
then MN = B. The limit in (iii) fails to exist for x eM. 


‘ r. 7 - 8 ope » py . 

It is a result of Gelfand’ and Pettis that the measurability of 742 ic a cons- 
quence of its weak measurability together with the fact that it is almost 
separably valued. Since || 7.2 || < C || z||, where C is a bound for || 7’. \), 


the function 7.x is then absolutely integrable over every set E C G of finite 


measure. 


‘ For a discussion of the type of integration of vector valued functions that we use here 
the reader is referred to S. Bochner, Fund. Math., vol. 20(1933), pp. 262-276; and N. Dun- 
ford, Trans. Amer. Math. Soc., vol. 37(1935), pp. 441-453, corrections, ibid., vol. 38(1936), 
pp. 600-601 

5 A space is called reflexive in case to each z B there is an z « B such that 7% = 
for every «%. This terminology is due to E. R. Lorch. 

* A space is said to be uniformly convex in case the length of a chord joining two points 
on the surface of the unit sphere approaches zero as its center point approaches the surface 
of the sphere. This concept is due to J. A. Clarkson. 

7 I. Gelfand, Zur Theorie abstrakter Funktionen, Comptes Rendus de |’Acad. des Sc. de 
URSS, vol. 17(1937), pp. 243-245. 

*B. J. Pettis, On integration in vector spaces, Trans. Amer. Math. Soc., vol. 44(1938), 
pp. 277-304; in particular Theorem 1.1. 
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We shall leave the verification of the properties stated for M until later, and 
fix our attention for the present on a single point z in J. We start with an 
arbitrary fixed set (1) of cubes J,. There is a subsequence y,, of y, such that 
r, > © and Zy,, — Zy for some y e B and every Z¢B. We shall express these 
facts by 
(3) Yrp ww Y- 


From (3) and the continuity of 7. we have 


(4) T a¥r, md Tay, ae G, 
We wish next to show that 

(5) Tay = Y, aeG. 
It will be sufficient in establishing (5) to show that it holds for every vector 
a’ = (0, 0, ---,a;,---,0) with all components zero except the j-th; for if 
a = (q,---,@,), thna =a +.--- +a" and 7, = Tala: ::: Ta. Now? 


l , 1 ; 
Tai Yr, = -! T pai xdg = 4 | T 3x dp, 
lp “Ir, Tp “Ir tai 
so that 


Tai Yrp — Yr = : il Tsx dB — / P52 | 
Pp L#ir +a! Ir, 
= : | f Tx dp — / Tax |, 
rs. A B 


where 
A= (1,, + a’) I, Dry + a’), 
B=I,, — I,,(I,, + @’). 
Hence | B| = |A| = r*'|a;| if rp > |a;|. Since || Tx || S$ C || a||, we 
have then 
1| T aiYr, — ¥, || Z 2-C-|| x ||-| a, |-*5 ifr, > | a; |. 


By (3) and (4), Taiyr, — Yr, w Taiy — y, and so we must have 


| Teiy — y|| S lim inf || Taiyr, — yr, || = 9, 


pe 


and thus equation (5) is established. 


* Here we are using the fact that v. | T sx dB = [Pave dg. See Theorem 2.3 in the 
I 1 


authors paper /ntegration and linear operations, Trans. Amer. Math. Soc., vol. 40(1936), 
pp. 474-494. 
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Now let a > 0 be a fixed positive number and Qi the closed linear manifold 
in $ determined by the points 

T atait — T.2, a eG, 

where a’ is the vector with a in the j-th place and all other components zero. 


Let M, = (Mt, --- , MB], ie., M. is the closed linear manifold in B deter- 
mined by the manifolds Mi (j = 1,---,m). It will next be shown that the 


vector 


(6) y- ce | T,x dB 
J 


belongs to I, for any cube J, of side a. To show this it will suffice to prove 
that iy _ va" | T 3x as| = 0 for every vector Z ¢ 8 which vanishes on M,. 
i™ 


Accordingly let @ be such a functional, then 


(7) ET apaikt = ET Qt (aeG;j = 1,---,n). 


of (7) we have 
1 _— 1ffrl f om at i 
(8) = 2T,2d8 = — { ET 3x dB + ET 32x dB >, 
J r vl, V r a Ja Ry if 
where R? = R*,; and R*,; is the rectangle defined as 
j=1 
¢¢e SaSe,+9r (i = 1, ---,n;¢ #9) 


r 
[2 |+esase tr. 
a 


Since r — H a <a, we have | Ri | s po | R.;| < nar" ", and thus 
a j7=1 
(9) > [. Tx d8|| S$ n-a-C-||2||+||2||-r7. 
r/R, 
Upon combining (i), (3), (8) and (9) with the fact that 


HT -HTO“#  -- 
Vela} ala r Va ; 


= r—l =" ° 
we have zy = V, i ET 3x df, i.e., 


aly _ ve" [ Tax as = 0. 


a 


r . _ R ; 
Let | |e the greatest integer in — and let J, be any cube of side a. In view 
a a 
7) 











(if 
He 
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This shows that the vector (6) belongs to M.. For a given ¢ > 0 we can find 
therefore a point ze t., || z|| < ¢, a finite number of constants b;;, and a 
finite number of vectors a;; « G such that 


(10) y — 7 T,x dp = > Y bidTexsseit — Te;;2] + 2. 
aW“Je i=] 

Upon operating on both sides of (10) by 7’, and averaging over J,, we obtain 

(see (5)) 


1 Lf <. ie 
i (F- a T4208) ay ‘. 


q p 
+2 Xe bag If Ty+a;;+0it dy — I Trseui2dr | 


j=l i=! 


(11) 


Now by a change of variable of integration 


Ty+0;j;+ait dy = / Ty40;; dy, 


Ty Ipt+ai 


and so 
/ T y+; ;+ai t AY = / Ty40;;0 dy - / Ty40;;2 dy -f Ty40;;7 dy, 
Tr Ir Al Bi 
where 
A, = (I, + a’) — 1,(I, + a’), 
B, = I, — I-([p + a’), 


Thus for r > a we have | Ai | = | B, | = ar", and this shows that 


ly- Ef EL mera) ar| 5 2-B-a-llzl-r +40, r >a 
| V, Tr Va Ja 1] 


q 

(if we make use of (11) and the fact that || z|| < «), where B = >> > | bg; |. 
j=1 i=] 

Hence 


lim sup ly — 7! T. (Ff Ty2dB) dr| SC «, 


and since « > 0 was arbitrary, we have 


(12) _ > lim hs T,udy, 


ro Vr Mly 


for every vector u of the form 


(13) u= :. [ T sx dg, 
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where in (13) a is an arbitrary positive number and J, is an arbitrary cube 
of side a 
Now let ¢ > 0 be arbitrary and fix & in the set where [ Tx d8 is differenti- 
J 
able to its integrand.” Let J, be a fixed cube of side a > 0 such that fo e J, 


and 


(14) | aI Tx dp — T's,2 | <e. 
1 Va Ja | 





Then 


( | 5) r “Ir | ! 


[ te i dy = [ T 4-892 dy ||, | 


and by (14) we have 


7 2 
| ! r,(7 ; Ty dB) dy -f Tryzé| 


Thus by (12) and (15) 


lim sup ae | Tornrds|| s Coe 


ro 


Since « > 0 was arbitrary, we have 
: , 1 
(16) y = lim Tz, | Tax dB. 
re V; Tr 


If 7’, were a group instead of a semi-group, we could operate on both sides of 
(16) with the inverse 7's, of 7's, and obtain the desired result. As it is, how- 
ever, a further argument is necessary. Let us suppose that Bo = (bi, --- , bs) 
has been fixed so that 0 S$ b; S$ 1 (i = 1,---,n). Let I; be those points of J, 
whose distance to the boundary of Gis 2 1. Then 


(17) \J,-I;| <r" -(r- 1)’, 


and J! — By C G. From (17) we see that 


(18) Ts2d8 — 7 TerdB| < (1 — (1 —r)"-C- [2]. 
rvl, | 


l 

V, Ir 
10 This is true for almost all values of 8). Actually one can prove easily from the group 

property on the 7’, that this integral is differentiable to its integrand at all interior points 

of G. 








up 
its 
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Now 


A 


y — + |. Tsxa8 < y - Ty |; Tx dB 
ve Ir Fs Ir 


ede asf. T,2d8 — [. Tx dB 
V; Tr I 


r—So 


(19) 


It follows from (16) and (18) that 


(20) lim jy — Ts, . [. Tsxdp\ = 0. 
r—2 ¥; Tr 
If we write A = I? — (I) — &), B = (I, — B) — (I; — Bo), then the 


. —} 
measures | A |, | B| of A, B are at most nr" and 


u T's, if T 32 dg _ [. Taxash 
V, Tr Tr—3o 


(21) 

- 5 nif Tpx dB - | rszaph < 2-n-C- || xfer. 
r A B 

Thus (21) and (20) show that the left side of (19) approaches zero, and this 

fact combined with (18) shows that 


. l 
(22) y = lim | Tx dp. 
ro rly 
Upon placing y = Uz, y, = U,«, we see that U, is a uniformly bounded set 
of continuous linear operators with lim U,z = Uz for x in 2; thus U,x con- 


re 
verges for every xin 9. This proves that M is closed and that U is continuous 
on Mt. Sinee T7,Ux = Uz (see (5)), we have U,Ux = Uz, which equality 
shows that UM C Mand that U? = U. Statement (iv) of the theorem follows 
immediately from the preceding remarks. 

If the unit sphere in 8 is weakly compact, then every bounded set is weakly 
compact and thus Mt = B. If B is reflexive, then its unit sphere is weakly 
compact” and so in this case also M = B. If B has an equivalent uniformly 
convex norm, then & is reflexive,” and so in this case also M = B. This 
completes the proof of Theorem 1. 

An instance of Theorem 1 will now be given. Let E be an abstract set of 
points P and suppose there is a finite completely additive non-negative measure 


" For a proof of this fact see B. J. Pettis, A note on regular Banach spaces, Bull. Amer. 
Math. Soc., vol. 44(1938), pp. 420-428, and V. Gantmakner and V. Smulian, Sur les espaces 
dont la sphére unitaire est faiblement compacte, Comptes Rendus de |’ Acad. des Se. de l’URSS, 
vol. 17(1937), pp. 91-94. 

' For a proof of this see B. J. Pettis, A proof that every uniformly convex space is reflexive, 
this Journal, vol. 5(1939), pp. 249-253; and D. Milman, On some criteria for the regularity 
of spaces of (B), Comptes Rendus de |’Acad. des Sc. de 1’URSS, vol. 20(1938), pp. 243-246. 
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function m defined over a Borel field %(£) of subsets of EZ, and suppose that 
E ¢ X(E) and that A(E) is completed with respect to m; i.e., A(E) contains 
all subsets of sets of measure zero. Then the space L,(F, m) of functions f(P) 
measurable on E and for which 


fil = il \f(P) dm] < © 


is, as is well known, a Banach space whose conjugate space is Lgjcq—1) (2, m) in 
caseg > 1. Forg = 1 the conjugate space is the space of essentially bounded 
and measurable functions with || f || = ess. sup. | f(P) |. 

THEOREM 2. Suppose that T, is a uniformly bounded group of linear trans- 

formations in L,(E, m) (q > 1) generated by a group S(a, P) of point transforma- 
lions of E into itself, 7.., 
(i) S(0, P) = P, S(a + B, P) = S(a, S(B, P)), a,BeEH,, PE, 
(ii) Tx = 2[S(a, -)], aeH,,xeL,(E, m). 
Suppose further that for each x in L,(E, m) the function x|S(a, P)| ts measurable 
in the product space E, X E. Then there is a continuous linear projection y = Uz 
on L,(E, m) such that 


(iii) “U ! aeE,,2¢€L,(E, m), 


(iv) Ux = lim V>" i z[S(a, -)] da, the limit being in the norm of L,(E, m). 


re I, 
(v) The range UL,(E, m) of U is the closed linear manifold in L,(E, m) consisting 
of those x for which a ¢ E, implies x|[S(a, P)| = x(P) for almost all P ¢ E. 
To prove Theorem 2 it will be sufficient to show that (a) the group is weakly 
measurable, (b) for each z in L,(E, m) the function 7.x is almost separably 


valued, and 
(c) / T.xda = / 2(S(a, -)] da. 
Ty Tr 


These three facts, as we shall see, are all immediate consequences of the meas- 
urability of z[S(a, P)| in the product space EZ, X E. First we shall make 
another observation. Let x be the characteristic function of the measurable 
set e. By the group property on S(a, P) we have 2[S(— a, P)| = y(P), where 
y is the characteristic function of S(a, e). Thus, since it was assumed that 
7.2 € L,(E, m), we see that S(a@, e) is measurable if ¢ is measurable. Let z, 7 
be the characteristic functions of the measurable sets e, @, respectively. Then 


4T 2 = [ Asc, P)|dm = mléS(—a, e)], 


and thus by the theorem of Fubini 27,2 is measurable. Since the characteristie 
functions form a fundamental set in L, as well as in L, (even when q = 1), 
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we see that (a) is satisfied. It might be mentioned at this point that (a) is 
entirely equivalent to the assertion that m[@S(a, e)] is measurable in a@ for 
every pair e, é of measurable subsets of E. Since E, is the sum of a denumerable 
number of cubes, and since the characteristic functions of measurable sets form 
a fundamental set in L,(E, m), it will suffice in proving (b) to show that for a 
characteristic function z, 7.x is almost separably valued when considered as 
defined only onacube J. Let f(a, P) = 2[S(a, P)], where z is the characteristic 
function of a measurable set e C E. Thus f is the characteristic function of a 
measurable set in the product space E, X E. There will be therefore a sequence 
f.(a, P) of characteristic functions each composed of a finite sum of functions 
of the form ¢(a)¥(P), where ¢, ¥ are characteristic functions of measurable sets 
in EZ, , E, respectively, and such that 


/ da | fa(a, P) — fla, P)|*dm = / da / Ifa(a, P) — fla, P)|dm — 0. 

I E I gE 

There is then a sequence n; — ~ such that 

lim i \fn,(a, P) — fla, P)|\*dm = 0 almost everywhere on J. 
gE 


1-2 


Thus the function 7,2 = f(a, -) has for almost all @ in J its values in the closure 
of the set in L,(£, m) determined by the set of functional values of f,(a@, - ) 
(n= 1,2,---). This proves (b). Finally let ¢ = #(P) bea point in L,(E, m), 
so that 


z | T.xda / 2T,.2da = / da | 3(P)2z[S(a, P)| dm 
Ir Ir Tr Ek 


[ F(P) dm [ a[S(a, P)] da. 


Then (c) is established. 
In order to apply Theorem 1 to the case 8 = L(E, m), we need the following 


THEeoreM 3. A set F in L(E, m) is weakly compact if and only if 


(i) i f(P) |dm < C, feP, 
kK 


(ii) lim [1 dm = 0 uniformly for fe F. 


m(e)=0 ve 


Any weakly compact set in a Banach space is bounded so that (i) holds if F 
is weakly compact. A negation of (ii) yields a sequence e, ¢€ % with m(e,) > 0 


and a sequence f,¢F such that [ sce) am > « > 0. By assumption 


there is a subsequence {g,} of {f,} such that [ one) dm converges for every 


© 
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e e , and thus by the well-known Hahn-Saks theorem” the integrals / gn(P) dm 


are equi-absolutely continuous. This is the desired contradiction. 

To prove the converse we assume (i) and (ii). Let {G;} be a basis for the 
open sets of real numbers and let |f;} be an arbitrary sequence of points f; ¢ F. 
Let E,; be the set in %& where f;(P) ¢G; , and let % be the closure in & (under 
the metric (E’, E’’) = m(E’ — E"”) + m(E” — E’)) of the field determined 
by the double sequence {£;;}. Then % is a Borel field and as a metric space 
is complete and separable. The space Lo(H#, m) of % measurable functions 
which are summable on E with respect to m is a closed linear manifold in 
L(E, m). Let {E£;} be dense in the metric space %, and using the diago- 
nal process of G. Cantor, pick a subsequence {gi} of {fi} such that the 
lim | gi(P) dm exists for each 7 = 1, 2,---. Now (ii) implies the uniform 

Bj 


‘2 


(with respect to f « F) continuity of [s@) dm at any point e in &. Thus 


we conclude that the lim | g:i(P) dm exists for every Eye %. This limit, 


1-0 Eo 
being absolutely continuous with respect to m, determines (by the theorem 
of Nikodym)“ an % measurable function g which is summable with respect 
to m and such that 


(23) lim / gi(P) dm = / g(P) dm, €9 € Ap. 


Let My be the conjugate of Lo(E, m), i.e., Mo is the space of bounded %%> measur- 
able functions. Let Mo be the set of those ¢ in My which assume only a finite 
number of values. We see by (23) that 


(24) lim / $(P)g(P) dm = i $(P)g(P) dm, 

iw 4E E 
for every @ in M?. Since M¢ is dense in Mo we have, using (i), the fact that 
(24) holds for every@¢ = My. Nowevery¢ «eM = L(E, m) determines uniquely 
a do € My such that 


(25) [ omar) dm = / ¢o(P)g(P) dm, g ¢ Lo(E, m). 
k 4 

Thus since g; , g are in Lo(E, m) we see from (24) and (25) that 
/ ¢(P)gi(P) dm > / ¢(P)g(P) dm, oe M, 
K zg 


which shows that the set F is weakly compact in L(E, m). 


13S. Saks, Addition lo the note on some functionals, Trans. Amer. Math. Soc., vol. 35(1933), 
p. 987. 

14 OQ. Nikodym, Sur une généralisation des intégrales de M. J. Radon, Fund. Math., vol. 
15(1930), pp. 131-179. 
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Simple examples show that (i) is not a consequence of (ii) alone. If, how- 


n(e) 
ever, E is representable as a sum > E; of sets with m(E,) < «, then (ii) im- 
i=l 


plies (i). This is the case when E is a bounded set in Euclidean space and m 
is Lebesgue measure. 

THEOREM 4. Let S(a, P) be a group of point transformations satisfying 
(i) S(O, P) = P, S(a + B, P) = S(a, S(, P)), a,BeE,, Pek. 
(ii) If ee A, so is S(a, e) € A and m[S(a, e)] = mfe]. 
(iii) For each x ¢ L(E, m) the function x[S(a, P)| is measurable in the product 
space E, X E. 
Then for every a ¢ E, and x ¢ L(E, m) the function Tx = x|S(a, - )| is in L(E, m) 
and 


(iv) i 2(S(a, P)| dm = [ x(P) dm, ee, ae E,, re L(E, m). 
e S(a,e) 

(v) || Tax |j = || Il. 

(vi) There is a continuous linear projection y = Uz of L(E, m) into itself such 

that 

(vii) T,Uz = Uz, reL(E,m),aeE,, 


(viii) Ux = lim v;" | a[|S(a, - )] da, the limit being in the norm of L(E, m), 


reo Tr 


1.€., 
lim / y(P) - : / ziS(a, P)| da dm = 0, y = Uz. 
ro “EK r “I, 


(ix) The range UL(E, m) of U is the closed linear manifold in L(E, m) consisting 
of those x for which a « E, implies x[S(a, P)| = x(P) for almost all P «¢ E. 


An elementary calculation based on (i) and (ii) shows that (iv) and (v) hold 
for % measurable functions which assume only a finite number of values. If 
|Z} is a sequence of such functions approaching z in the mean as well as almost 
everywhere, then x,{S(a, P)] — z[S(a, P)] almost everywhere and the integrals 


[ =18(@, P)| dm = [ x,(P) dm 
e S(a,e) 


are equi-absolutely continuous. “Thus z[.S(a, P)] is summable in P and equa- 
tions (iv) and (v) hold for every z in L(E, m). The conditions (iv) and (v) 
show, when combined with Theorem 3, that for a given x ¢« L(E, m) the set 
Tt (ae E,) is weakly compact. It then follows immediately that the set 


r—1 , : . 
v, / Tt da is weakly compact; for 
Ir 
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[am |v. . [ z[S(a, P)] da] = V>" [ da / a[S(a, P)] dm 


= vy; f da [ x(P) dm < sup [ x(P) dm. 
Ty S(a,e) a S(a,e) 


Thus Theorem 4 follows from Theorem 1. 


THeoreM 5. Let the group T, (a¢€E,) satisfy the conditions of Theorem 1. 
Then for each x in B the function Tx is uniformly continuous on E, . 

In the case of semi-groups on EF, we can only state a sort of one-sided con- 
tinuity in the interior of E,, , i.e., Tat — Tx if a — B in such a way that its 
components are greater than those of 8. Theorem 5 may be proved by the 
same method we have used for one parameter groups” and so we omit the 
details. The uniform continuity stated here follows from the uniform bounded- 


ness of the group. 


YALE UNIVERSITY. 


‘* N. Dunford, On one parameter groups of linear transformations, Annals of Math., 
vol. 39(1938), pp. 569-573. 





THE EXISTENCE OF CERTAIN TRANSFORMATIONS 
By G. T. WHYBURN 


The general existence problem for a specified type of transformation of one 
given compact set onto another is of long standing and has received numerous 
contributions over a considerable span of years. For example, a classical result 
of Hahn and Mazurkiewicz yields the conclusion that any compact locally 
connected continuum can be mapped continuously onto any other one but cannot 
be so mapped onto a non-locally-connected continuum. 

In this paper the question of the mappability of a compact locally connected 
continuum M onto an interval by particular sorts of continuous transformations 
will be considered. This is, of course, the same thing as considering the defina- 
bility of particular kinds of continuous, real-valued functions on M. In this 
connection the reader is referred to closely related papers by Cech,’ Mazurkie- 
wiez,’ Aitchison,’ C. Pauc,* Kuratowski,’ and the author.° 

We consider monotone, non-alternating, interior and light transformations. 
If A and B are compact continua, a continuous transformation 7(A) = B is (1) 
monotone’ provided the inverse set 7 '(b) of each point b in B is connected, (2) 
non-alternating® if for any two points x and y of B, T”'(x) does not separate any 
two points of 7” '(y) in A, (3) interior’ provided the image of every set open in A 


is open in B, and (4) light provided that for each point b in B, T”'(b) is totally 
disconnected (or of dimension 0). 

The principal results will be found in §§2 and 4. In §2 it is shown that a 
compact locally connected continuum M can be mapped onto an interval by a 
non-alternating interior transformation f if and only if the cyclic elements of M 
are arranged into a cyclic chain. In §4 it is shown that in case M is 1-dimen- 
sional, f can in addition be chosen as a light transformation. 


— . — . . . 
|. Lemmas on joining and subdivision. If a and 6 are points of a locally 
connected continuum M and K is the set of all points separating a and b in M, 


Received March 22, 1939. 

‘ Fundamenta Mathematicae, vol. 18(1932), p. 85. 

? Ibid., p. 88. 

*>Comptes Rendus de la Société des Sciences et des Lettres de Varsovie, vol. 27(1934). 

‘Comptes Rendus, Paris, vol. 202(1939), p. 489. 

> Fundamenta Mathematicae, vol. 80(1988), p. 17. 

* American Journal of Mathematics, vol. 55(1933), p. 131. 

lL. Moore, Transactions of the American Mathematical Society, vol. 27(1925), 

p. 416; C. B. Morrey, American Journal of Mathematies, vol, 57(1935), p. 17; also reference 
in footnote 8 

*See G. T. Whyburn, American Journal of Mathematics, vol. 56(1934), pp. 394-402 

’See Stoilow, Annales Scientifiques de Ecole Normale Supérieure, vol, 63(1928), 
pp. 347 382. 
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the cyclic chain C(a, b) in M from a to b is the set consisting of K + a + b plus 
all cyclic elements of M intersecting K + a + 6 in exactly two points. The 
set C(a, b) is known also to consist of all simple ares in M of the form azb. In 
this paper we shall have occasion frequently to impose the condition that M be 
identical with some cyclic chain C(a, b) in M. 

(1.1) Lemma. Let M be a compact locally connected continuum such that M = 
C(a, b) fora, be M. Let A Da, B Db be continua in M not separating M. 
Then for any x «M—(A + B), there exists an irreducible continuum AxB in M 
between A and B containing x which is locaily an arc at all points of ArB—(A + B). 

Proof. Let G be the upper semi-continuous decomposition of M into the 
sets A, B and points of M—(A + B). Let T(M) = M’ be the associated 
(monotone) transformation. Let T7(A) = a’, T(B) = b’. Then M’ is a locally 
connected continuum; and since a’ and b’ are non-cut points of M’ and T is 
monotone, it readily follows that M’ = C(a’, b’). Hence M’ contains an are 
a’xb’ and we have only to set 


AzB = T™'(a’xb’). 
Let the locally connected continuum M be a cyclic chain C(a, b). By a 


subdivision « of M will be meant a finite, linearly ordered, set of disjoint 
irreducible cuttings of M between a and b. 


a= X,,X1, X93, Xs, --: , Xn, Xn = J, 
where X; (1 S 7 S n) cuts M into two connected sets M,(X;), Ms(X;) so that 
i—l n 
M,(X,) Da+ > X;and M,(X;) D>b+ D0 X;. Theset 
j=l 


i+1 
Xi + X; + M,(X;)-Mo( Xi) ”_ I; 
will be called the interval from X;_; to X; (a = Xo, b = Xni1). A set of the 
form M,(X,)-M,(X,_:) will be called an open interval of c. 


(1.2) Lemma. Given any subdivision oo and any « > 0, there exists a subdivision 
a containing oo and such that if X ; is any element of o, 


V(X) DT + Tin. 


Proof. For convenience we will suppose the metric in M is the so-called 
‘ . . . . 10 a . 
‘relative distance’”’ of Mazurkiewicz. Now let e = fe. Since the sets V,(z), 
x ¢ M, are open (and connected) and cover M, we can select a finite number of 
them 


Vi = V(t), Ve = Ve(te), ---, Vm = Veltm) 
which cover M. 


1° Fundamenta Mathematicae, vol. 1(1920), p. 167. In this metric, spherical neighbor- 
hoods of points or of connected sets are connected. Note: For any set X and any r > 0, 
V,(X) denotes the set of all points at a distance < rfrom X. 
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Now consider V,. If V; > 6b, we need go no further. Suppose V, does not 
contain b. Let X{,:be the first element of oo such that M,(Xis1) DVi. Nowif 
V2.(21)-Xi41 ¥ 0, we need go no further. If this is not so we proceed as follows. 
Applying (1.1), using the continua a = M,(X?) + Xj and 8B = M,(Xiy1) + 
X?.,, we obtain a continuum az which is locally an are at z, where ze V,. 
Let y be the last point of V2-(z1) on az in the order a, 8, and set 


A = V, + az (of az), B = yB (of az). 


Then A and B are disjoint continua; accordingly" there exists a set X’ which 
separates M irreducibly between A and B and indeed such that M — X’ has 
just two components each bounded by X’. Clearly X’ is on the “b side” of 
V; (ie., M.(X’) D V;,), and X’- V2.(21) ¥ 0 since A-V2(21) 0 # B-V2-(x1). 
In exactly the same manner we construct a set X” on the “a side”’ of V; such that 
V2(21)-X" + 0. Now let us add X’ and X” to oo and call o the resulting 
subdivision. 

Next let us proceed with V2 and a; in exactly the same way as we did above 
with V; and oo and obtain a subdivision oz containing elements X’ and X” each 
intersecting V2,(x2) and such that V2 is in the interval of o2 from X” to X’, i.e., 


V2 © My(X”)-M,(X’). 


Continuing in this manner to V,, , we obtain a subdivision ¢,, , which we call o, 
having the property that for any 7 S m there are elements X; , X; of o inter- 
secting V2,(x;) and such that 


Vi © M,(X;)-M.(X;x). 


Since e = }e and the sets [V;] cover M, clearly this is equivalent to our lemma. 


2. Non-alternating interior transformations into an interval. 


(2.1) THeorem. [f the locally connected continuum M is a cyclic chain C(a, b), 
there exists a non-alternating interior mapping f of M onto the interval (0, 1) such 
that f-'(0) = a, f*(1) = b. 

Proof. By (1.2) we can set up an infinite monotone sequence of subdivisions 
01, og, --- of M such that for each n, o, satisfies the conclusions of (1.2) for 
e=n'. 

Let G denote the collection of all the elements in all of the subdivisions a; . 
Then G is a non-separated collection (see footnote 11) of cuttings of M between a 
and b. Furthermore, if X «Gand pe M — X, by Lemma (1.2) there exists a o, 
containing X and such that neither interval of c, abutting on X contains p. 
Thus there exists an element Y of G (in fact of ¢,) which separates X and pin M. 


‘t See the author’s paper in the Transactions of the American Mathematical Society, 
vol. 33(1931), pp. 444-454. A collection G of disjoint subsets of a connected set M is called 
non-separated provided no element of G separates in M two points lying on another element 
of G. 
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Hence by a result previously established,” if for each z eM — G’ (G’ = >> X) 
we let Z be z together with all p « M which cannot be separated in M from z 
by any single element of G, then Z is closed; and if we call Gp the collection 
obtained by adding to G all such sets Z, then G is upper semicontinuous. Fur- 
thermore, every element of Gp except the ones containing a and b must separate a 
and bin M. For clearly each new element Z can be represented in the form 


Z= IIT 7., 
1 


where J, is the interval of o, containing z. 

Hence the hyperspace of Gp is a simple are which we may suppose is the 
interval (0,1). Let f be the associated transformation of M onto (0, 1). Since” 
the collection Go is non-separated, it follows that f is non-alternating. To see 
that f is also interior, let U’ be any open set in M and let re UU. There exists a 
region R,x CRCU. If «eX €G, then since R intersects each of the regions 
M,(X) and M,(X) which map into the open intervals [a, f(X)] and [f(X), }] 
respectively, it follows that f(2) and hence f(U) contains an open interval about 
S(X) =f(x). If x belongs to no element of G, it follows by the way Lemma (1.2) 
gave the sets of G that there exist elements X ; and X, of ao, such that X;-R # 
0 # X,-R and z belongs to the interval from X; to X,. But this gives f(R) D 
[f(X;), f(X%)] D f(x), so that f(U’) contains an open interval about f(z), and our 
proof is complete. 

It will be noted that the transformation f is so defined that for any zx with 
0 <x < 1,f‘(x) separates M irreducibly between a and b into exactly two 
regions. Thus we have 

(2.11) If M ts unicoherent, f will be monotone. 


The property of f just mentioned actually is a necessary consequence of the 
properties sought in Theorem (2.1) as will now be shown. 


(2.2) Turorem. Let f(x) be a non-alternating interior mapping of a locally 
connected continuum M onto the interval (0, 1) and let A = f'(0), B = f*(1). 
Then for each y with 0 < y < 1, f(y) separates M irreducibly between A and B 
into just two components. Neither A nor B separates M and if ae A, be B, M is 
identical with the cyclic chain C(a, b). 


Proof. By known properties of non-alternating transformations (see footnote 
8) it follows that f-'(y) separates M into just two components R, and R, con- 
taining A and B, respectively. Furthermore R, maps onto (0, y)—y and R, 
onto (y, 1)—y. Thus since f is interior, f(y) must be the boundary both of 
R, and of R,. Hence f~'(y) separates M irreducibly. 

Since neither 0 nor 1 separates (0, 1) and f is non-alternating, it follows (see 
footnote 8) that neither A nor B separates M. 

12 [bid., pp. 451-452. The equivalence used here of the property of a transformation 


being non-alternating and the property of its associated decomposition being non-separated 
is perhaps worthy of special note. 
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To show that M = C(a, b), let us suppose on the contrary that there is a 
component R of M—C(a,b). The boundary of R isa single point z and x cannot 
belong to A or to B since neither of these sets separates M. Thus z ef ‘(y) for 
some y with 0 < y < 1. But this is impossible since f ‘(y) would separate M 
into at least three components. Thus M = C(a, b), and our proof is complete. 

Combining (2.1) and (2.2), we have 


(2.3) THrorem. In order that a compact locally connected continuum M be 
mappable onto the interval (0, 1) by a non-alternating interior transformation f so 
that f(a) = 0, f(b) = 1, a,b e M, itis necessary and sufficient that M be identical 
with the cyclic chain C(a, 6). 


3. Lemmas on separation. 


(3.1) Lemma. Let M be a locally connected continuum, let f(M) = (0, 1) be 
continuous anda ef '(0),bef ‘(1). Foreachx,0 < x < 1, wecan choose a subset 
X of f-'(x) such that the collection |X] is a non-separated collection of cuttings of M 
between a and b. 

Proof. For each x, 0 < x < 1, let R,(x) be the component of M — f'(z) 
containing a, let R,(x) be the component of M — R,(z) containing b and let 
X = F{R,(z)]."° Then clearly 

X = F[R,(2)]-F[Rs(z)] C f(z). 
Further, if 0 < 2; < 22 < 1, we have 
X\ ‘.. R,(x1) je R,(22), Xe . Ry(xe) = R,(2x1). 


° a. 14 
Hence the collection [X] is non-separated. 


(3.2) Lemma. If a and b are points of a 1-dimensional locally connected 
continuum M and F is any closed 0-dimensional subset of M, there exists a 0- 
dimensional set X C M — F which separates M irreducibly between a and b into 
just two components. 


Proof. There exists (see footnote 6) a light transformation f(z) of M onto 
(0, 1) so that f(a) = 0, f(b) = land f(F) Cf(a) + f(b). Hence, applying (3.1), 
we obtain an uncountable, non-separated collection of 0-dimensional cuttings X 
of M between a and b no one of which intersects F. Thus” some one of these 
sets (in fact all but a countable number of them) separates M irreducibly between 
aand b into just two components. 

The results just established lead at once to 


(3.3) THeorem. If A and B are disjoint subcontinua of a 1-dimensional 
locally connected continuum M and F is any 0-dimensional closed subset of M, 


18 For any open set R, F(R) denotes the boundary of R, i.e., the set R — R. 

‘4 Compare this lemma and its proof with that given by the author on pp. 452-453 of the 
paper cited in footnote 11. 

16 See p. 450 of the paper cited in footnote 11. 
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there exists a 0-dimensional set X C M — F which separates M irreducibly between 
A and B into just two components. 

Proof. Let a denote the continuum obtained by adding to A all components 
of M — A except the one containing B; and similarly, let 8 = B plus all com- 
ponents of M — B except the one containing A. Let M’ be the hyperspace of 
the decomposition of M into the sets a, 8 and individual points of M—(a + 8) 
and let 7(M) = M’ be the associated monotone transformation. Let 7(a) = a, 
T(8) = b. Then M’ is 1-dimensional and locally connected; and, applying 
(3.2) to M’, we get a 0-dimensional set X' C M’ — T(F) which separates M’ 
irreducibly between a and b into just two components. Then if we set X = 
T‘(X’), it follows by known properties of monotone transformations that X 
satisfies our theorem. 


4. Non-alternating interior light transformations into an interval. Let M 
be a 1-dimensional locally connected continuum which is identical with a cyclic 
chain C(a, b). A subdivision ¢ of M will be called 0-dimensional if each element 
of it is 0-dimensional. A subdivision o’ will be called a refinement of o provided 
it is obtained from o by inserting additional elements. It will be understood 
that any refinement of a 0-dimensional subdivision is itself 0-dimensional. 
Actually in this section we shall deal only with 0-dimensional subdivisions. 


(4.1) Lemma. Given a 0-dimensional subdivision o of M and any « > 0, there 
exists a refinement o' of o such that each component of an open interval in o’ is of 
diameter < e. 


Proof. Let K be the sum of all elements of o and let d = ~oe. By well-known 
decomposition theorems for 1-dimensional locally connected continua,” there 


exists a 0-dimensional closed set F in M — K and a decomposition M = > M 
1 


such that for each 7, M; is a locally connected continuum of diameter < d and for 
i #j,M;-M; CF. 

It follows at once from (3.1) and the argument given under (1.2) that there 
exists a refinement o; of o such that if K, denotes the sum of all elements of o;, 
then K,-F = Oand for eachi S n, K,-M; # 0. 

Now let M; and M; be any two of the sets [M;] such that p(M;, M,) > 4 
It will be shown that 

(*) there exists a refinement ox of o; such that no component of an open interval of 
o; intersects both M; and M; . 

To this end, let J = XY be any closed interval of o,; such that the open 
interval E = J — (X + FY) intersects both M; and M;. Let F; and F; be the 
boundaries of M; and M,, respectively; i.e., F; = F-M;,F; = F-M;. Since 
the closure of any component of M;-E (or M;-E) must intersect either both 
X and Y or both X + Y and F; (or F;) and K,-F = 0, it follows that M;-E and 
M ;-E have finite numbers, say n,; and n; respectively, of components. 


16 See, for example, Menger’s Kurventheorie, p. 191. Note also references there given to 
I £ ’ 
Urysohn and Vanek. 
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Now for any component R of M;-E whose closure intersects both X and Y, 
let V be a neighborhood of Y so that V-F; = 0. Then R contains a connected 
subset S such that S.-Y = Oand S DR.(M — V). Thus any component of E 
which intersects R but is not contained wholly in R must intersect S. Let us 
replace each such component RF of M,-E by the corresponding set S and call S; 
the resulting set. Similarly let us obtain a set S; in M;-Z. Then S; and S; 
are subsets of M; and M; having n; and n; components respectively and such 
that (1) any component of # which intersects both M; and M; must intersect 
both S; and S; and (2) the closure of each component of S; or of S; intersects 
exactly one of the sets X and Y. 

Let us add to M,(X) + X the closure of all components of S; and of S; whose 
closures intersect X and call A the continuum thus obtained. Similarly, let 
B = M.(Y) + Y plus all components of S; and of S; whose closures intersect Y. 
Applying (3.3) to A and B, we obtain a 0-dimensional set Z irreducibly separating 
M between A and B into just two components and not intersecting F. 

Now there are, say, m; components of S; and m; components of S; in the 
interval XZ and the closure of each of these intersects X but not Z; and there are 
n; — m; components of S; and n; — m; components of S; in ZY and the closure 
of each of these intersects Y but not Z. 

Consider the interval XZ. Since M,(X) is connected, there is some com- 
ponent P of S; or of S;, say of S;, which can be joined by an are a in M,(X) 
toa point of Z such that a intersects no other component of S; or of S;. Now, 
just as we replaced R by S above, we can replace P by a connected subset Q of P 
with @-X = 0 which also intersects a and has the property that any component of 
XZ—(X + Z) which intersects both S; and P also intersects Q. Let B, denote 
the continuum M,(Z) + Z + a+ Q and let A; be the continuum M,(X) + X + 
8;-— P-XZ+8;-XZ. Applying (3.3) to A; and B, , we obtain a 0-dimensional 
set W, irreducibly separating M between A, and B, into just two components and 
not intersecting F. Further, the interval XW, has the property that any com- 
ponent of X W, which intersects both M; and M ; must intersect one of the m; — 1 
components of S; which are in X W, , and the closure of each of these components 
as well as of each component of S; in XW, intersects X but not W,. Repeating 
this argument, we get a set Wz between X and W, which separates off either a 
second component of S; in XW, or one of S;in XW, , and soon. After at most 
m: + m; steps we obtain a subdivision X, W., W.i,---, We, Wi, Z of XZ 
so that no component of an open interval in this subdivision can intersect both 
M;and M;. 

In exactly the same manner we can subdivide the interval ZY. Hence we have 
shown that each interval XY in o; can be subdivided so that no component of an 
open interval in the refinement can intersect both M; and M;. Let us so sub- 
divide each interval in o; and call o;, the resulting refinement of o,;. Then (*) 
is satisfied. 

Now applying (*) successively to all pairs M; , M; satisfying p(M;, M;) > 4e, 
we obtain eventually a refinement o’ of ¢ which must satisfy our lemma. For if 
there were a component R of an open interval of o’ of diameter > ¢, we could 
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find points p, q « R with p(p,q) > 2 > 4¢ + 2d. Henceif M; > p,M; D4q, we 
have p(M;, M;) > p(p, gq) — 2d > }e. Accordingly no component of an open 
interval of o’ could intersect both M; and M;, and we have a contradiction. 


(4.2) THeorem. If the 1-dimensional locally connected continuum M is 
identical with its cyclic chain C(a, b), a, b « M, there exists a non-alternating light 
interior transformation f(x) of M onto the interval (0, 1) so that f-'(0) = a, f-'(1) =b. 


Proof. We note first that by virtue of (3.3), under the conditions of our 
theorem, we can obtain (1.2) with the stronger conclusion obtained by sub- 
stituting “‘0-dimensional subdivision o”’ for the words “subdivision o’’. 

Thus by using (4.1) in addition, we can set up a monotone sequence of 0-dimen- 
sional subdivision o; , a2, --- of M such that, for each n, o, satisfies the con- 
clusion of both (1.2) and (4.1) for e = n™' 

Now by exactly the argument given in the proof of (2.1), beginning with the 
second paragraph, we define our function f(z) and show that it satisfies all our 
conclusions except the property of being light. But that f is light results at once 
from (4.1) and the fact that each ¢, is 0-dimensional. For if 0 < y < 1, the 
set f '(y) either belongs to o, for some n (which gives dim f '(y) = 0) or else it is 
of the form 


rq) =I z,, 


where £, is an open interval of ¢,. In the latter case, since each component of 
E,, is of diameter < n‘, f-'(y) must be 0-dimensional. 

The theorem just proved together with a known result” on light interior trans- 
formations gives the following interesting additional conclusion. 


For any point x of M there exists a simple arc arb in M which maps topologically 
onto the interval (0, 1) under f. 
Thus f(x) gives a sort of “are development 


(4.2). 


5. Interior transformations of graphs and dendrites onto an interval. It is 
known” that any interior transformation on a graph or a dendrite is necessarily 
light. Also it follows from (2.2) that the only dendrite which can be mapped 
onto an interval by a non-alternating interior transformation is the simple are. 
However, if we omit the non-alternating requirement on the transformation, we 
find that a more inclusive but not all-inclusive class of dendrites can be mapped 
onto an interval. We begin with 


9718 


to M under the conditions of 


(5.1) Any connected graph A can be mapped interiorly onto an interval. 


For let X be the set of vertices in a subdivision of A and let Y be a set obtained 
by selecting one point from each component of A — X. Then if we define 


17 See my paper in this Journal, vol. 3(1937). Note also references there given to Stoilow 


and Montgomery. 
18 Compare this with results recently announced by C. Pauc. See footnote 4. 
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f(x) = 0, if re X; 
f(z) 1, ifreY 


and let f map the closure of each component of A—(X + Y) topologically onto 
(0, 1) preserving end points, clearly f maps A interiorly onto (0, 1). 

(5.2) THeorem. In order that a dendrite A be mappable interiorly onto an 
interval it is necessary and sufficient that (a) no point of A cut A into infinitely many 
components, and (b) if H is the set of all end points of A, H — H be a finite set. 


Proof. To show the conditions necessary, let f(A) = (0, 1) be interior and 
let K =f '(0) +f (1). Then since every end point of A must (see footnote 17) 
map into an end point of (0,1), we have K > H; and by a previous result of the 
author’ it follows that K — H is a finite set. Then since K is closed, it results 
that (b) is necessary. Now suppose, contrary to (a), that a point z of A cuts A 
into infinitely many components. Then since z is a limit point of H, we have 
reK,i.e., f(x) = Oor f(x) = 1; but since there is only one component of (0, 1) — 
f(x), there can be only a finite number of components of A — f 'f(x). Clearly 
this is a contradiction. 

To show the conditions sufficient, let us suppose they are satisfied in A. Since 
AH is connected and H —H is finite and (a) is satisfied, it follows that A—H 
has just a finite number, say n, of components. Now let N be a set containing 
just one point of order 2 from each component of A—H. Then A—H—N 
has just 2n components R, , R2, --- , Re, each of which has just one limit point 
in N and one or more limit points in 7. 

Let R be any one of the sets R;, let z = R-N and Y = R-H. Decompose R 
into the set Y and the individual points of R — Y, let R’ be the hyperspace of 
this decomposition and let T,(R) = R’ be the associated transformation. Let 
T(z) = x’, T(Y) = y’. Since there can be no end point of R& in R, it readily 
follows that R’ is identical with the cyclic chain C(z’, y’) in R’. Hence by (2.1) 
there exists a non-alternating interior transformation 72(R’) = (0, 1) such that 
T;'(0) = 2, Tz'(1) = y’. It readily follows that the transformation 727’, is 
interior. 

Let us now define f(z) as follows: 


(0 for re N; 


f(x) = 41 for xe H; 


\T:T\(x) forzeR:, 


where 7’; and 73 are defined for R; as 7; and 7; were defined above for R. Then 
since each 737; is interior, and since any open set in A intersecting N or H 
contains a connected open subset of some R; which maps into a non-degenerate 
interval abutting on 0 or 1 respectively, it follows that f(z) is interior. 


THE UNIVERSITY OF VIRGINIA. 


'? See this Journal, vol. 4(1938), p. 609. 





THE WEIERSTRASS CONDITION FOR MULTIPLE INTEGRAL 
VARIATION PROBLEMS 


By LAWRENCE M. GRAVES 


Consider the problem of minimizing a multiple integral 
I= [s2, p) dt = / see | 40, x, p) at -++ dt, 
R . 


in a class of admissible manifolds in (t, x)-space with equations in the form 
1 7 . 
x = x(t), where xz = (x, ---, 2"), t = (4, --- , tn), and p denotes the matrix 


(p2) = (dx'/dt.). The Weierstrass E-function has the form 
Kt, Zz, Pp, P) - f(t, z, P) — f(t Z, Pp) so (Po - Padfi (t, z, P), 


where f; = af/ap, and the usual summation convention is used. We suppose 
that f and its partial derivatives f7 are continuous in a certain region S of (t, z)- 
space for all p.' For the class of admissible manifolds we take all manifolds 
x = x(t) lying in S, of class D’ on the fixed region F of t-space, and having a fixed 
boundary over the boundary of R. Then a necessary condition for a minimum 
of I is that E(t, x, p, P) = O for all (t, x, p) on the minimizing manifold and for all P 
such that the matrix P — p = (Pa — Pa) has rank one. 

A very brief and elementary proof for the condition is given in §1 below.” 
The proof is similar to one given by the author for simple integral problems.’ 
The Weierstrass condition as stated obviously applies also to problems in para- 
metric form. In §2 the condition is transformed so as to be expressed entirely 
in terms of the n-rowed minors of the matrices p and P.* 


1. Proof of the Weierstrass condition. Let the minimizing manifold Mo have 
equations z' = ¢'(t), and suppose that the partial derivatives 4¢'/dt, are con- 
tinuous near ¢ = f. Let these derivatives for t = i be denoted by pi. If the 


matrix Ap = P — p has rank one, it may be represented in the form Ap, = 


Received March 27, 1939. 

1 Obviously we could also consider restrictions on the admissible values of the p, . 

2 For more general classes of admissible manifolds McShane has shown that E 2 0 
almost everywhere. See Annals of Mathematics, vol. 32(1931), pp. 578-590. Another 
type of proof has been developed by Coral. See this Journal, vol. 3(1937), pp. 585-592. 

2 A proof of the Weierstrass condition in the calculus of variations, American Mathematica! 
Monthly, vol. 41(1934), pp. 502-504. 

‘In the cases n = 1 and n = m — 1 (with more general hypotheses on the class of ad- 
missible manifolds) McShane gave a direct proof of the transformed condition for para- 
metric problems. See Annals of Mathematics, vol. 32(1931), pp. 723-733. 
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P.. — px = F,G', where FF, > 0, G'G' > 0. Then there exists a matrix 
(B,.) such that 
| = 
(7) (y=1,---,n—lja=1,---,n) 


isnon-singular. Introduce the new variables u, = Bya(ta — fa), v = Falta — ba). 
Let n(u, 6) be a function which vanishes for uju, = b°, is positive for uju, <b’, 
and is such that » and dn/du, approach zero with b. For example, we may take 


i 


n= (1 — uyu,)’ — (1 — Bb)’. Let C' = (a9'/av)'~* + G’, and set 
(1) X‘(u, v) = ¢'(u, 0) + C'v. 
Then at ¢ = 7? we find 
ax' ag’ | 
2 = 9+ FG = Pi. 
(2) Ota Ota + 


Suppose 0 < « < 1, and for u,u, < 6 set 


, [¢'(u, ») — X'(u, en)] 
n— , 


w'(u,v;b) = ¢'(u,n) + ww — 4 


By use of (1) and the assumption that dn/du, tends to zero with b, we find that 
if the partial derivatives of w' are evaluated for u,u, < b°,0 < v < 7», then as b 


Boo’ " 
OUy Ou, ; 
ry an ag'\'~! “a 
— — ef = _ a. 
Ov l—e ov ov l—e 


Consequently 


Dew" ag;\"~* P — € ‘ 
3 coe el - F.G' = p, — Ap. 
8) Ata (**) i=" "fi 


tends to zero, 


Now let R; denote the region of t-space determined by the inequalities uju, < 
b°. 0 <v < en, and let R2 denote the region determined by u,u, < b°, en <v < 79. 
If W(b) is the n-dimensional “‘volume’”’ of R; + R:, then eW(b) is the volume of 
R, and (1 — ¢«)W(b) is the volume of R,. Let the manifold M, be defined as 
follows: 


ve on &,, 
x = <w'(t; b) on Re, 
lop elsewhere. 


Then M, coincides with the minimizing manifold Mo except over the region 
R, + Re and is an admissible manifold when b is sufficiently small, except possibly 
when the portion of Mo over R, + Rez has points in common with the boundary 
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of the region S where admissible manifolds must lie. Let us assume that the 
boundary D of the region S is a manifold of class C’. Then in case M, turns out 
to lie on the wrong side of D, a change of the signs of the F. and G’ leads to a 
manifold M, on the right side of D, except possibly when the manifold x‘ = X‘(t) 
is tangent to D. In this exceptional case a slight modification of the matrix P 
leads to a manifold z' = X‘(t) which is not tangent to D, and then considerations 
of continuity show that the desired result still holds. Thus, since our method 
of proof makes no use of the vanishing of the first variation, it is applicable to the 
case of “unilateral variations’’. 
Since 7(M>) is a minimum we have 


I(M,) — I(Mo) € / » 
te ; = i 

0<s Wo) Wo) te X, X,) dt 

We 

(1 — ¢)W(b) Je, 

If we apply the mean value theorem to each of the three integrals and let b tend 

to zero, we find by (2) and (3) 


i 
+ flt, w, o) dt — gro fe 0, oda 


(a f(P) + (1 = of(p - ; £47) - f0) 2 0, 


where for convenience the arguments / and = ¢(@) of the function f have been 
omitted. In deriving the necessary condition (4) no use has been made of the 
existence of the partial derivatives f7. When f has a total differential with 
respect to the arguments p, , we may divide (4) by ¢ and let ¢ approach zero to 
obtain 


(5) f(P) — fp) — (Pt. — pe) fi(p) 2 0. 


2. Transformation of the Weierstrass condition for the parametric problem. 
When the variables ¢, are regarded as parameters and only the x‘ are considered 
as coérdinates, we usually wish the integral J to be independent of parameter 
transformations with positive Jacobian determinant. A necessary and sufficient 
condition for this is that the integrand f(t, z, p) be independent of the ¢, , and be 
equal to a function g(z, j) of the codrdinates x‘ and the n-rowed minors j, of the 
matrix p = (p) which is positively homogeneous of the first degree in j7. We 
assume that n < m and that on admissible manifolds the matrix p has rank n 
at every point, so that not all of the j, are zero. If, for each a, (p., ---, Da) 
are regarded as homogeneous coérdinates of a hyperplane in (m — 1)-dimensional 
projective space, then the 7, may be regarded as the homogeneous codrdinates of 
an (m — n — 1)-dimensional linear manifold in this space. For a full discussion 
from the geometric point of view of the relations holding between the j, , the 
reader may refer to Bertini’s Geometria proiettiva degli iperspazi, pp. 33-39. 
The discussion given below is in a form better suited to our purpose, and is 
perhaps briefer. 
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In each matrix such as (p’,) and (kj) we shall understand for definiteness that 
the subscript is the row-index and the superscript is the column-index. We 
recall that all the solutions of the equations 


(6) Paw; = 0 


may be expressed in terms of the n-rowed minors j, of the matrix p. The index p 
may be regarded as a symbol for a combination of n elements chosen from the 
set » = (1, --- , m), and the index 7 as a symbol for a combination of n + 1 
such elements. Each column of the matrix k = (k;) is to be a solution of equa- 
tions (6) having (m — n — 1) zero elements, the remaining elements being minors 
jp formed from the (n + 1) columns of p corresponding to the elements of the 
combination 7, with proper signs attached. The matrix k is to include all such 
columns of solutions, so that it has m!/[(m —n+1)!(n+1)!] columns. A definite 
law may be set down determining the order of the columns of p occurring in 
each minor j, and determining the signs chosen for the j, in each column of k. 
We suppose such a law chosen, so that the vector j and the matrix k are uniquely 
determined when p is given. Under a transformation of the parameters t. 
whose Jacobian matrix is denoted by A, the matrix p is replaced by P = Ap, 
and the new matrix K corresponding to P is obtained from k by multiplying the 
elements of the latter by the determinant of A. Thus in particular p is not 
completely determined by k. 

The rank of the matrix k is always exactly (m — n). For it cannot be more, 
since the rank of p is n and the columns of k are solutions of equations (6). And 
it cannot be less, since every solution of (6) is expressible in terms of the columns 
of k. This statement about the rank of the matrix k yields the quadratic rela- 
tionships which hold between the minors j, . 

We next wish to show that if a vector J = (J,) is given such that the corresponding 
matrix K = (Kj) has rank (m — n), then there exists a corresponding matrix 
P = (P4,) of which the J, are the n-rowed minors. It is plain that there exists a 
matrix p of rank n such that pA; = 0. For definiteness of notation suppose 
the matrix K arranged in blocks in the form 


Ki Ks 
Ko Ky, ‘ 
where K; is a square non-singular minor of m — n rows, having zeros off the main 


diagonal, and diagonal elements equal to + J,,. Then p has the form (p; , p2), 
where ps has n columns and is non-singular. The block k; of the matrix 


— ky ks 
= ; 7 


formed from the matrix p is obviously a multiple cK; of K,. Every column of k 
is a linear combination of the columns of 


(7) (=). 
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so that ke = cK,. Wenext proceed to show that the elements of K; and hence of 
K, are determined by the elements of K, and K,. Recall that K, has elements 
+J,, # 0 on the diagonal and zeros elsewhere. The row index 7 for K,; ranges 
over the set u — po. The elements K; of Ke are all the J, such that ¢ has n — 1 
elements in common with po. If now 7 contains just two elements of un — py 
and 7 is in wu — po, K; is either zero or +J, , where o contains n — 1 elements of 
po , and so K; is already determined. The remaining elements of these columns 
are then also determined since every column is a linear combination of the 
columns of (7). The elements so far determined include all the J, such that 
has at least n — 2 elements in common with po. Proceeding thus, we see that 
all the elements of K are determined by those of its minor (7). Hence k = cK, 
and the desired matrix P may be obtained from p by dividing a row of the 
latter by c. Obviously P is determined up to a transformation of the form 
P’ = AP, where A has determinant unity. 

We note next that if the matrix P — p has rank one, a transformation A of 
determinant unity may be applied so that P = p except in one row. Then 
n — 1 rows of p are orthogonal to the columns of both k and K, where k cor- 
responds to p and K to P, so that the rank of the matrix (k, K) is not greater 
than m — n+ 1. On the other hand, suppose the matrices k and K are given, 
each of rank m — n, and that (k, K) has rank at most m — n+ 1. Then we 
may select the matrices p and P corresponding to k and K, respectively, so that 
they differ at most in the first row, and then obviously P — p has rank one at 
most. 

In terms of matrices p and P so selected the Weierstrass condition of §1 yields 
the inequality 

S(P) — fp) — (Pi — pifi(p) = 0. 


When this is expressed in terms of the function g(z, 7), it becomes 
— _ ... ae or > 
gJ) — g(7) — (Pi — pig, ~* 2 9, 
Opi 


where g, = dg/aj,. .But j, = pidj,/ap:, J, = P\aj,/ap;. Thus for para- 
metric problems the Weierstrass condition may be stated in the following form: 


E(z, j, J) = g(x, J) —_ g(x, 9) = (J, = JoGo(X, J) = 0 


for every element (x, j) of the minimizing manifold and for every vector J such that 
the corresponding matrix K has rank m — n and (k, K) has rank not greater than 
ma—n + I. 


THe UNIveRsity oF CHIcaGco. 
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COMPLETELY MONOTONE FUNCTIONS AND SEQUENCES 


By Witty FELLER 


1. Introduction. Consider a function f(x) defined by the Laplace-Stieltjes 
integral 


@ 


(1) fle) = l e* dF(t), 


where F(t) is never decreasing and is bounded in every finite interval and the 
integral converges, say, for xz > 0. In this paper we propose to deduce a simple 
inversion formula for (1) by an argument which enables us at the same time to 
prove in a very natural way some theorems of the theory of Laplace-Stieltjes 
integrals. Thus our argument provides an extremely simple proof of the well- 
known theorem to the effect that a function f(z) can be represented in the form 
(1) if, and only if, it is completely monotonic, i.e., if it has for z > 0 derivatives 
of any finite order such that 


(2) (-—1)’f"'(z) 2 0 (n = 0,1,---). 
A theorem which is substantially equivalent to this statement is proved by 
F. Hausdorff.’ In its present form it was first formulated by S. Bernstein,” and 
subsequently independently by D. V. Widder.’ A simplified proof was then 
given by J. D. Tamarkin,* and subsequently Widder himself proposed an alter- 
native proof.” Widder has also proposed some inversion formulas’ for integrals 
of type (1), but the formula given in the sequel seems nevertheless to be of 
some interest and proves the theorem of Hausdorff-Bernstein in a more direct 
way. Moreover, the method is easily applicable to other problems, in particular 
to the problems treated by Widder in his papers referred to above. As an 
example we deduce a necessary and sufficient condition that a function f(x) be 
representable in the form (1), when F(¢) is only supposed to be of bounded 
variation in every finite interval; this condition is equivalent to a similar condi- 
tion obtained by Widder.’ Also the case of a function F(¢) with a bounded 
derivative will be treated in the sequel. 

We then deal with the general interpolation problem for completely mono- 


Received April 1, 1939. 

‘Hausdorff [4], II, Theorem 3. (Numbers in brackets refer to the list of references 
at the end of the paper.) 

? Bernstein [1]. 

® Widder [10]. 

*Tamarkin [9]. 

® Widder [11], Theorems 17-18. 

® Widder [11]. 

? Widder [10], Theorem 12. 
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tonic functions. Let {z,} (n = 0, 1, --- ) denote an increasing sequence of 
non-negative numbers tending to infinity, and let to every z, correspond a 
number a, = 0. It is moreover supposed that the series }> 1/z, is divergent. 
Then a necessary and sufficient condition that there is a function f(x), com- 
pletely monotonic for x > 29 and such that f(z,) = a, (n = 1, 2, --- ) is that 
the divided differences of the sequence {a,}, as defined by (24), be non-negative. 
In that case f(x) is shown to be uniquely determined and representable by 
Newton’s interpolation series. For n = 0 we always have f(z) S a, but in 
general equality does not hold. The argument in this section will rest on quite 
elementary properties of the interpolation series. We notice that this inter- 
polation problem has already been treated by Hausdorff in connection with 
his investigations of the moment problem for 0 S xz S 1. Subsequently 
Widder’ has by another method treated in detail the special case of equal 
intervals, 2, = Nn. 

Finally, we derive an inversion formula which determines the function F(¢) 
of (1) directly in terms of an arbitrary sequence of values f(x,), provided only 
that x, tends to infinity and the series a 1/zx, diverges. This formula is a 
counterpart of the inversion formula mentioned above, the only difference being 
that Taylor’s formula is replaced by Newton’s interpolation series. In the 
special case of equal intervals, z, = n, related inversion formulas have been 
found by Hausdorff and Widder,” but our formula uses other differences and 
reduces to neither of theirs. 

Incidentally, it may be pointed out that the general inversion formula for 
unequal intervals of the argument is also of great importance for the stochastical 
theory of telephone traffic. In fact, this problem was proposed to the author 
by Conny Palm, of the telephone-administration of Stockholm, who was con- 
fronted with this and similar problems in the course of his important practical 
investigations on the telephone traffic. It seems that our inversion formulas are, 
with a slight modification, very suitable for numerical computations." 


2. Theorem of Bernstein-Widder and inversion formula. We proceed to 
prove 


THEOREM 1. A necessary and sufficient condition that the function f(x) can be 
represented, for x > 0, in the form (1), with F(t) non-decreasing and bounded in 
every finite interval, is that f(x) have for x > 0 finite derivatives of all orders satis- 
fying relation (2). In that case 


® Hausdorff [4], II. 

* Widder [10], pp. 880-886, and [11], part V. 

10 Hausdorff [5] and Widder [11], pp. 174-194. 

11 Dr. O. Lundberg has kindly drawn my attention to the fact that a proof of the theorem 
of Hausdorff-Bernstein on lines similar to those indicated in the present paper was recently 
sketched by Dubourdieu [3]. His starting point is the differential equations of a particular 
stochastic process related to the theory of sickness-insurance, and he arrives in the par- 
ticular case of a bounded F(t) at the inversion formula (3) of our text and the proposition 
referred to. For an alternative proof in this particular case cf. also footnote 13. 
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’ [ty] low )” i 
8) FQ) = tim Ye — 9" FG) 
no n=0 nN! 
in any continuity point of F(t). 

The first part of the theorem, as stated above, is proved by Hausdorff, S. 
Bernstein and D. V. Widder. We note that the restriction to the interval 
z > 0 means no loss of generality. For if the integral (1) converges for x > 2 > 
0 only, we get for fi(x) = f(x + x) and xz > 0 the representation 


(4) filz) = | “aR, with FQ) = [ €* aF(r), 
0 0 


and conversely. 

Proof of Theorem 1. The necessity of the condition (2) needs no comment. 
In order to prove the sufficiency we define a function F,(¢), depending on a pa- 
rameter » > 0, by F,(0) = 0 and 


[ta] 


(5) F,() = > =< f°) for t > 0. 


By (2) F,(¢) is for ¢ 2 O a never decreasing function. With this function we 
have, since f(x) is analytic for « > 0,” 


[ e *' dF, (t) _ > gw (—1)" 7G) 
0 nt 


n= 


wo 


=) nf] — 7 — 9)" fn) 


n=0 7! 


= f(n{l — &*"}). 
Thus 
(6) lim [ * &*aF(t) = f(a). 
no 40 


We shall now investigate the behavior of F,(t) for 7 ~ «. If f(0) is finite, 
it follows from (2) and (5) that F,(¢) s f(0) for all 7 > 0. Otherwise choose 
§ > 0 arbitrarily small and denote by a a number such that a > e’. Then for 
0 Sn S ty and 7 sufficiently large 


2-2 2 (1 « *" >a, 


and hence 


@ FO sab YF) (_ayy@ 5 a'f0). 


n=0 ! 
It follows that it is possible to pick out a sequence m— © such that F,,(t) con- 


" See, for example, S. Bernstein, Lecons sur les propriétés extrémales des fonctions ana- 
lytiques d’ une variable réelle, Paris, 1926, p. 190. 
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verges essentially to a never decreasing function F(¢), which is bounded in every 
finite interval: 


(8) lim F,,() = Fd 


in any continuity point of F(é). 
Next we prove that 


(9) lim I ear, = [ ear 
0 0 


ko 


forz > 0. Now for any T > 0 we have 


x n 
/ e“dF,) <¢ Dem" =< fase dX (=a , 
T+1 n>T, nm: n>Ty nm: 
and hence by the argument already used 


x 


/ e "dF, (t) < e er y (—ne 
T+1 


n=0 n! 


—t2/1) n 


f(a) 


=e ¥Tfig{l — e*}) s esl — e™), 


provided that 7 > 1. Consequently, given any e > 0 and 6 > 0, we can choose a 
T > O such that 


x 
/ e “dF,(t) <.« 
T 

forz >éandy > 1. Thus the integrals in (9) converge uniformly, so that (9) 
follows from (8). 

Comparing (9) with (6), we get for f(z) a representation of the desired type, 
and in order to complete the proof, it remains only to show that (8) holds for 
any sequence m tending with & to infinity. But this is almost obvious, for 
otherwise we should get two essentially different representations for f(x) in the 
form (1), and this is known to be impossible.” 


13 This is a simple consequence of the theorem of Weierstrass on approximation by 
polynomials; see, e.g., Hausdorff [5], pp. 222-223. At the present stage of our proof the 
correctness of the statement may also be seen directly in the following way. It is known 
that 


k 
, (kt)” 1 if t<1 
* | —kt _ ’ 
“ —_ p> r! {0 if t>1. 


This is Theorem 1 of Widder [11]. It may also easily be verified by evaluating e~*‘(kt)*/k! 
k oo 

by Stirling’s formula and majorating : > (kt)’/r! if ¢ > Lor bo (kt)’/r! if t < 1 by a geo- 
0 k+1 


metrical series. From (*) it follows that 


(tgz) (2t)" 
lim e~** =l if i<t<t. 


! 
= [eyzyt1 7 














ry 
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3. The function /'(t) of bounded variation. 


THEOREM 2. A necessary and sufficient condition that f(x) can, for x > 0, be 
represented in the form 


(10) f(z) = | e*'dV(t), 


with V(t) of bounded variation in every finite interval and the integral converging 
absolutely for x > 0, is that for any 6 > 0 there exists a constant M = M(6) such 
that 

x zr aa 6 n a) 
(11) y F= Pp) | < ME) 


n=0 nm: 


uniformly for x > 6. 
This condition is easily shown to be equivalent to a similar condition derived 
by Widder.” 
Proof. (i) Let f(x) be defined by (10) and suppose that 
(12) V(t) = Vilt) — Volt) 
is the canonical decomposition of V (¢) in two non-decreasing functions. Define 


fi(z) and fe(x) by 


@ 


(13) f(z) =| et aVi(t) G = 1,9) 
0 
and put 
(14) ois) « I etal Vit) + Vad}. 
Then, for any z > 6 > Oand any N > 0, 
0s >|" at ea) = [ ) — t"d{Vilt) + Vad) 

n=0 t: n=0 v3 

(15) : 
<[ e*“d{Vi + VO} = ¢0). 
0 


@o 

Now let f(x) = | e *dG(t), with G(t) never decreasing and bounded in every finite inter- 
0 

val. Then 


[tex] —r)" (r) r) te [tex] zt)" 
lim > (~ 2)" f'"( > lim / e = _ ( : dG(t) = G(t) — G(&) 
ve t Tr! 


“2 ! 
ze [tyr] +1 r: Grrr rhs {¢;2]+1 


for any pair (t; , é2) of continuity points of G(t). Hence G(t.) — G(ti) S F(t) — F(t), 
where F(t) is defined by (8), and this implies the uniqueness of the representation (1). 
In case of a bounded F(t) the argument yields another simple proof of the inversion formula 
(3) and the theorem of Hausdorff-Bernstein. 

“ Widder [10], Theorem 12. 
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But since 
e(z) = fi") +hr(x) and fi(a)f2"(z) 2 0, 
obviously 
s'(@) | = |" @) -— fr" @ | S| e"@) |. 
Thus (15) implies (11) with M = ¢(6). 


(ii) Conversely, let us suppose that (11) holds. Define two non-decreasing 
functions P,(t) and N,(t), depending on a parameter 7, by 


lt 
P,) = 7s 
nm! 
(16) [tn] _n 
NiO) = 27 f° @ | — (-—p"s'@)}, 


fC) |, 


wheret > 0. For P,(t) we may by an argument already used deduce an inequal- 
ity similar to (7), viz., 


(17) P,(t) < a'M(8), 


holding for any given 6 > 0, a satisfying the inequality a > e’, and 7 suffi- 
ciently large. Hence for a suitable sequence 7, tending to infinity the functions 
P,,(t) converge essentially to a never decreasing function P(t) which is bounded 
in any finite interval. Now by (16) the variation of N,(¢) in any finite interval 
does not exceed twice the corresponding variation of P,(t). Hence we may 
choose the sequence 7 in a way that also N,,(t) converges essentially to a non- 
decreasing function N(t). 

Repeating then the argument used for the proof of Theorem 1, we see that 
for 7 > 1 


[Otero s & mnt rm see YO yma 


n2=T n2=T 


< er SF iall - ae —7\" ) gn) < tela 
"ta > ~ (F'n) | S Ml — &*""}) 


n=0 


e *™ M(1 — e*). 


Hence it is seen that the integrals converge uniformly and 


@ 


lim | e*aP,,() = [ e* dP(t), 
ke 40 0 
and also, since the variation of N,(t) is dominated by that of 2P,(), 
lim | e*dN,,(t) = e* dN (0). 
0 0 


ke 
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We thus obtain 


@ 


[ e“atP@ - NO} = lim [ e'd{Px) — Nault} 


= lim oe ning (— <—J f"'(m) = lim of y meld —¢ | — »)*f"G). 


kw n=0 ko n= 
But j(z) is clearly an analytic function” so that the last series equals 
f(m{1 — e&*""}) 
which quantity tends to f(z). Hence 


@ 


(8) je) = [ &*ajP® - NO}, 
0 
and the proof is complete. 


4. The function F(t) with a bounded derivative. As a further application of 
the method we prove the following theorem of Widder:”* 


TEEOREM 3. A necessary and sufficient condition that f(x) can be expressed in 
the form 


(19) fie) = | &*o(t) a, 


with g(t) uniformly bounded, is that 


(20) Zs°@| <4 (n = 0,1, +++), 
n! x 
where A is a constant. 
Proof. (i) If f(x) is given by (19) and | ¢(z) | A, then 
x (n) a —s6 —- e zt (xt)" A 
nit ()) = [ . ! olf) dt oad n! onion s 


This proves the necessity of the condition. 
(ii) Conversely, it follows from (20) that for 0 < t < t& 


[t2n] 


(21) y ig fC) < A(t — hh). 


(tin) 7 n! 
Consider now the function P,(¢) defined by (16). By (21) its variation in any 
interval 0 < t; < te does not exceed A(t, — ¢,), and the same is naturally true 
also for the limiting function P(t). Thus the derivatives of P(t) are bounded 
by A, and hence 


t 
(22) P() = / gi(s)ds with 0 S ¢(s) S A. 


1 See footnote 12. 
1° Widder [10], Theorem 13. 
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But, as it has already been observed, the variation of N,(¢) in any interval does 
not exceed twice the corresponding variation of P,(t), so that also for N(t) a 
relation similar to (22) holds. In view of (18) these relations imply (19). 


5. The interpolation problem. Consider a sequence of real numbers 
(23) SBS Ce % «++ € HS «es, tn > © 


and let us suppose that to each z, corresponds a number a, 2 0. Then the 
divided differences of the sequence {a,} are formed by induction in the following 


MN 


manner 
[an] = an, 


(24) - (ai, , Gi, , es , a;,] ~_ [a;,, ai, , =~ a;,_,] 


[ai,, @i,, ***» Qi, ’ 


Vi, — Lig 
so that the order of a divided difference is less by unity than the number of 
arguments required for its definition. It should be remembered that the divided 
differences depend essentially on both sequences {z,} and {a,} even though the 
usual notation [a;, , --- , @;,] does not show it. We note further that the divided 
differences are symmetrical functions of their arguments. In case of equal 
intervals of the argument, say z, = n, we have 


» l n 1 - n—k {1 

(25) [ao, Qi, ++, Qn] = , 4 a = , dy (1) | a. 

n! n! k=0 k 
For the sequel it will, however, be necessary not to restrict ourselves to the 
divided differences [ay , x41 , --- , @n] of consecutive a; . 

The sequence {a,}, depending on the argument-values z,, will be called 
completely monotonic, if for any set of non-negative integers (%, ---, %) 
(n = 0, 1, ---) 

> n 
(26) (—1)"[ai, , ai, , +--+, @:,] 2 0. 


Consider now a sequence {a,} defined by the values taken by a completely 
monotonic function at the points z, : 


(27) an = f(rn) (n = 0,1, ---); (-1)"f°"(z) 20 for x =m. 


Then it is known that there is some value é between z;, and z;, such that 
ns 1 (n) 
[a;, , iia Sy » i] nal nit (é), 


and hence the sequence {a,} is completely monotonic. Before proving the 
converse theorem, let us observe that increasing a) makes of {a,} a new sequence 
which is still completely monotonic. On the other hand, it will be proved that 
decreasing a» in a sequence such as (27) makes of it a sequence which is no longer 


17 Cf., e.g., the treatise of Milne-Thomson [7]. 
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completely monotonic. Such sequences are called by Widder minimal se- 
1 vr . . . 
quences. With this preliminary remark we proceed to prove 
TueoreM 4. If the sequence {a,} corresponding to the argument values (23) is 
completely monotonic, and <f the series 


(28) Dd 1/2n 


diverges, then there is a uniquely determined function f(x), defined and completely 
monotonic for x = 2%, such that 


(29) S(2.) = a, (n = 1,2,---); 
for this function 
(30) f(z0) S a, 


and if io, 11, +--+ ts any strictly increasing sequence of integers such that D1, Xi, 
diverges, then for x = Xp 


(31) f(z) _ D> (—1)*[a;,, ai,, “+, dy) (zig —Z)- (v4, — 2). 


Proof. Let {in} be a strictly increasing sequence of positive integers and let 


(32) > 1/2, 


be divergent. Suppose that % 2 1, and let 7 be an integer such that 0 Si <<. 
Then by Newton’s interpolation formula (or by a straightforward inductive 
argument) 


(33) D> (—1)*[a,,, mae. ** "5 ai,] (xi, _ xi) atts (zi,_, _ x) 


= a;- (— 1)**"[a;, Qin» Vij, °°» diy] (zi, — Zi): (tix — i) 


for any N = 0. Now by (26) the sum on the left side contains non-negative 
terms only, while the right member does not exceed a;. Hence we may let 
N— and obtain 


(34) Da (—1)*[ai,, ai,, -*+, Gy) (tig — 2) +++ a, — He) S a. 
k=0 


But for any fixed x > 2x; and k sufficiently large 0 < 2, — x < 24, — 2;, 80 that 
(34) implies that the series 


(35) i, ee eee ee eee 


converges for allz 2 1%». 


'8 Widder [10], p. 880. 








670 WILLY FELLER 
Next we show that in case 1 S i < % 
(36) (—1)**"a;, Qi, , Gi, ,-** , Qiyl(Zi, — Zi) --+ (Liy — 2) 7 0 


as N — «. In fact, by (33) and (26) the quantity on the left side is never 
increasing and non-negative. Suppose now that 


(37) lim (—1)*"™ [a;, ai,, «++, Giy] (ti, — 2s) +++ (tiy — 2%) =a > 0. 


N--2x 
Then, for any k 2 0, we should have 
(—1)*"[a;, ai, @i,, ***, Gig) (ts — 20) (tig — 20) +++ (Xi, — 20) 


> (—1)*" [a;, ai, «++, Gy) (xi, — 2a) +++ (Zu, — 2) (tH, — 2) 


Xi, = Li 


But it was supposed that 7 2 1; hence it follows from (34) by a slight change of 


notation that 


> (—1)**"[a;, aig, @s,, ++» Gig) (i — to) (aig — 20) +++ (7%-, — To) S S(z0), 


while 
=, a(x; — Xo) 


k=0 Vi, — Di 


diverges by hypothesis (32). This clearly contradicts (38) so that (37) must be 
false. Thus (36) is proved. 
Comparing now (36) with (33), we see that for 1 <7 < % 


x 
(39) > (—1)*[ai,, «++, Gig] (aig — 2) +++ (ty-, — 2) = O. 
For i = 0 the inequality (34) cannot, of course, be improved, as the left member 
is independent of a while increasing a does not alter the completely monotonic 
character of the sequence {a,}. 

Consider now the function defined by the series (35). We propose to show 
that it in no way depends upon the particular choice of the %. To prove this 
we put for z 2 2% 


(40) fx) = > (—1)* as, 5 Ginny, °° * > Graal (te, — 2) +++ (2i,45-, — 2): 
k=0 
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It is easy to prove that all f,(x) are identical. In fact, for any r = 0 and N 2 0 
we have 
Xx 
k 
p> (- 1) lai,.,, Ginags °° *s Gi, se411 (2s, = x) ‘ibe (Zi +e = x) 


> (—1)*{[a;,, aie tah M.re00d ence “ie xi,) 


k=0 


(41) + [as,, ++, O6-44)} (ting, — 2) (tia, — 2) 
N+1 
— 2 (=I lai,» +++ diss ips — 2) oo+ (45,,,-, — 2) (ti,4, — Z,) 


+ p> (—1)*la,, 0+, Giga) (Zips, = %) eee (2i4. — x) 


N 
_ ks (—1)*fai,, stan. ai, ,,] (ri, —_ 2x) ica Gara = x) = Ry, 
k=0 
where 
Ry = (—1)*"[a;, eres » Bornes Sires wat x) se (Si,4n as Z)(Linan4s ions 2i,). 


But Ry — 0 for any fixed x = 2% and r; this follows immediately if in (36) we 
replace 7 by % and 2 by 2,424: (K = 0,1, ---), which is only a change of notation. 
Thus we may in (41) let N — ~, and it is seen that 


(42) Srsi(z) = f(z) = f(z). 


Now it follows from (39) that f(z;) = a; fort = 1,2,---,7%,—1. Butt, « 
as — o and thus we conclude from (42) that f(z;) = a; fori 2 1. Finally, 
f(to) S a by (34). 

In order to complete the proof it remains only to show that f(x) is completely 
monotonic and uniquely determined. But by writing f(x) in the form (40), 
we readily see that for x» S x S 2;, and any n 2 0 


(43) (—1)"f'"(z) 2 0, 


since all the factors z;, — x are non-negative and by hypothesis (26) holds. Now 
ris arbitrary and z;, — ©, so that (43) holds for any x 2 2. 

To prove the uniqueness of our interpolatory function we observe that (39) 
implies that two completely monotone sequences which coincide at the points 
of a sequence 2;, such that >> 1/z;, diverges differ at most by the first term. 
On the other hand, if f(x) is completely monotonic, and if {&} is an arbitrary 
sequence of points, then the sequence f(&) is also completely monotonic. Thus 
two completely monotonic functions which coincide at the points (23) are 
identical. This completes the proof of the theorem and we have also the 
following 


Coro.tiarRy. T'wo completely monotone functions which coincide at the points 
of a sequence x, such that y 1/x, diverges are identical. 
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By Theorem 2 this may be expressed also in the following way. If 


a® 


f(z) =| e dV (t), 


where V(t) is of bounded variation in every finite interval and the integral 
converges absolutely for z > xo, and if z, are the zeros of f(x), then the series 
> > 1/x, converges. This theorem has been proved by Wintner,” who has pointed 
out that it is a consequence of a theorem of Miintz” to the effect that an arbi- 
trarily good approximation of any continuous function in 0 S xz S 1 may be 
performed by linear aggregates of the functions x** (x, 2 0) provided that 
>> 1/2, is divergent. Conversely, the possibility of such an approximation is a 
consequence of the uniqueness theorem, as has been proved here by a direct 
interpolation. 


6. Inversion formula for sequences. Given a completely monotonic sequence 
la,}, by (31) we are able to calculate the corresponding interpolatory function 
f(x), and then by (3) also the function F(t) to which it belongs. It has, however, 
been pointed out in the introduction that practical problems require a direct 
determination of F(t) in terms of a,. For the special case of equal intervals, 
zZ, = n, such inversion formulas have been derived by Hausdorff and Widder, 
but they are insufficient for our actual purposes. 

Without loss of generality we may assume that 7% = 0. We then have 

THeoreM 5. Let0 = x <2, < --- <2,— & bea sequence of points such that 
> 1/x, diverges but : 8 1/z%, converges. Let a completely monotonic sequence of 
numbers a, 2 0 correspond to the points x,. Then the conipletely monotonic 
function f(x), which at the points x, (n = 1, 2, ---) takes on the values a, , may be 
represented in the form (1) with 

en (t) 


(44) F(t) = lim ; > (—1)*[aw, wai, +++, Ong] Dw Owgs +++ Lwye-1, 


N--0o k=O 


where gy(t) denotes the integer determined by the relations 


en (t)+N—1 1 en (t)+N 1 
(45) 2, =—<6 —2t, gn(0) = -1. 
k=N Tk k=N Tk 


Proof. Define for any integer N > 0 and fort 2 0 a function F' y(t) by 


en(t) 


(46) Fu(t) = >> (—1)"law, auys, +++, Guse] Dw nga ++ Dugas. 


k=) 


19 Wintner [12]. 
2° Miintz [8] or, e.g., the treatise of Kaczmarz-Steinhaus [6]. 
2! The supposition that y > 1/z, is convergent means obviously no restriction. 








te 
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Since the sequence {a,} is supposed to be completely monotonic, it is seen by 
(26) that F’y(t) is a never decreasing function of t. Taking in (34) 2; = x = 0, 
we see further that for any N > 0 

F y(t) < aH. 
Hence it is possible to pick out a subsequence {N,} such that the sequence 
‘Fy (t)} converges essentially to a never decreasing function F(t). Since the 
functions F'y(t) are uniformly bounded, we have for z 2 0 


(47) lim | e *dFy,(t) =| e dF(t). 
rx 0 0 
Now 


| e*' dF y(t) 


20 N+k—1 1 . 
= > exp (-z b> ) (— 1) lay, GQn+t, ***, Qnakltn2w41 °** DN+k-1- 


k=0 v=N y 


. 2 
But sinee >> 1/z; converges we have 


N+k-1 ] N+k-—1 x 20 1 
exp (~2 > ) = II (1 _ )+ o(& *). 
yv=n Ty v=N ry yv=N I, 


Thus we obtain by (31) 


ae 


| e dF y(t) 
0 


ew f(0) 
+ > (- 1)"[aw, Gnu, °**, aw +x) (aw ane x) a (tN4k—-1 of x) 


= ev (0) + f(z), 


with ey tending to zero as N— «. Hence by (47) 


a 


x 
| e dF(t) = lim | e ‘dF y,(t) = f(z), 
0 r—« “0 
and this is a representation of the desired type. Now the representation (1) 
was seen to be unique, and it is therefore impossible to pick out two subsequences 
\Fy(t)} with different limiting functions. This proves the correctness of (44) 
and completes the proof of the theorem. 

Remark. Obviously we are free to use instead of gy(t) any integer-valued 
function which as N — ~ is asymptotically equivalent to it. Thus we may in 
the case of equal interval, z, = nh, write (46) in the form 


[N(e*#—1)] r " 
(48) Fy) Dd (-o'(% “; i ') tay. 
k=0 


Replacing here N by a new parameter nh, we readily see that as h — 0 F y(t) 
tends to the function defined by (3). 
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In order to compare (48) with the corresponding functions of Hausdorff and 
Widder we make the substitution z = e ‘, thus getting a non-decreasing function 
in 0 < zs 1. Moreover, we have to take h = 1. Then Hausdorff’s N-th 


approximating function is a step function with N jumps of amount (—1)* (2 


A‘ ‘a, at the points k/N (k = 0, 1,---,N). Widder’s N-th approximating 

function has, as well as ours, infinitely many jumps at the points N/(N + k) 

(k = 0, 1,---). But the amounts of the corresponding jumps differ: with 

N+k-1 
N 


a%ay , and thus differences of a fixed order 
is ({N+k—-1 
are used. With our formula the jumps are of amount (—1)‘ ( + k A‘ay, 


Widder they are (— »"( 


so that differences belonging to a fixed point are used, i.e., the differences enter- 
ing also in Newton’s interpolation series. 
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ISOPERIMETRIC PROBLEMS OF BOLZA IN NON-PARAMETRIC FORM 
By Wiuu1AM T. ReEIp 


1. Introduction. For isoperimetric problems of the calculus of variations in 
parametric form an elegant proof of sufficient conditions for a strong relative 
minimum is afforded by the use of the so-called Lindeberg theorem. This result 
was obtained by Lindeberg [8]' in 1909, who at that time applied it to the plane 
isoperimetric problem in parametric form.’ Subsequently, this theorem was 
generalized and extended by Levi [7] and Tonelli ({13]; [14], vol. 1, p. 321). 
Recently Perlin [9] has developed a generalized Lindeberg theorem and applied 
it to the study of sufficient conditions for the parametric problem of Lagrange 
involving isoperimetric side conditions. 

For non-parametric isoperimetric problems of the calculus of variations, how- 
ever, neither the result obtained by Lindeberg [8], nor any one of the extended 
forms of the Lindeberg theorem established by Levi [7] and Tonelli ([13]; [14], 
vol. 1, p. 422), is effective in the proof of sufficient conditions for a strong rela- 
tive minimum. Recently, Hestenes [5] has given a sufficiency proof for the 
isoperimetric problem of Bolza in non-parametric form. His proof is a gen- 
eralization of the usual field method and does not use any analogue of the Linde- 
berg theorem; in particular, it involves the breaking up of the given extremal 
into suitable subares. 

It is the purpose of the present paper to derive an effective Lindeberg theorem 
for non-parametric problems of the calculus of variations with isoperimetric 
side conditions. For the sake of generality, we consider specifically a problem 
of Bolza with variable end-points. It is of interest to note that the analogue of 
the Lindeberg theorem presented in §4 involves only the Weierstrass &-function 
associated with the problem under discussion. It is first proved in §3 that if E 
is a non-singular extremal for the given problem satisfying the Weierstrass 
condition IIy , then there exists an associated problem which is entirely equiva- 
lent to the initial problem, and for which new and stronger forms of the usual 
Clebsch and Weierstrass conditions hold. The forms of these conditions thus 
derived render a simplicity to the results of §4 comparable to that for a problem 
which involves no auxiliary differential equations. They also enable one to 
simplify the expansion proof of sufficient conditions for the non-parametric 
problem of Bolza given by the author ({10], [11]). 


Received April 24, 1939; presented to the American Mathematical Society, December 28 
1938. 

' Numbers in brackets refer to the bibliography at the end of this paper. 

* See also Bolza [2], pp. 515-518. 
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Finally, in §6 there is established an Osgood theorem for the calculus of 
variations problem herein considered. 


2. Formulation of the problem. The problem to be considered is that of 
finding in a class of ares 


(2.1) yi(z) @=1,---,mmSrsm) 


satisfying the auxiliary conditions 


(2.2) dal, y, y’] = 0 (a =1,---,m <n), 
(2.3) Vl » y(x1), Ze, y(22)] = 0 (u - 1, rae »P Ss 2n + 2), 
(2.4) Xelti , (21), Ze, y(X2)] + / hz, y, y']}dx = 0 (s = 1,---,9) 


one which minimizes a given functional 
z2 

(2.5) J= gla ’ y(21), Z2, y (22) + / Slz, Y; y’) dz. 
Z1 


This is a problem of the general type of Bolza in the calculus of variations 
involving the isoperimetric conditions (2.4). For simplicity, this problem will 
be referred to as B. 

It is supposed that there is given an open region F of points [z, y, r] in which 
the functions fz, y, r], dal, y, 7), haz, y, 7] are of class C*. We shall also suppose 
that R, is an open region of (2n + 2)-dimensional sets [xz , yi , Ze , Yi2] in which 
the functions g, ¥,, x. are of class C’ and, moreover, the matrix (Wyz, Vay 
Wury Vuy;.) is of rank p. An are (2.1) will be said to be admissible for problem B 
if its defining functions are continuous, possess piecewise continuous derivatives 
of the first order, its elements [z, y(x), y’(x)] are in R, its end-points [x , yi(a), 
Ze, yi(Z2)| lie in R,, and for this are conditions (2.2), (2.3), and (2.4) hold. 

It is readily seen that the above problem B is formally equivalent to a problem 
of Bolza of the usual sort involving n + q dependent functions. For if we set 


(2.6) nis) = / hilt, y(t), y’()] at @ «3, 


the above problem becomes that of finding in a class of ares 


(2.1’) yi(x), u,(x) G@=1,---,nms=1,---,Qnszrzsn 


satisfying 
(2.2’) dalz, y, y’] = 0, u,t+h{z,y,y’'])=0 (a=1,---,m;s=1,---,9), 
Vulzi , y(21), 22, y(Z2)] = O (u = 1,--- 5p) 


(2.3’) 
Xelti, y(Z1), 22, y(Z2)) + u(r) = 0, wlre) =O (8 =1,---,9 
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one which minimizes the functional (2.5). This latter problem will be referred 
toas B. An are (2.1’) will be said to be admissible for 8 if and only if it is 
related by equations (2.6) to an admissible are for B. 

Suppose that y;(x), u.(x) is an admissible are for 8 which is without corners, 
and which satisfies with multipliers \y) = 1, A(z) (« = 1,---,m + q) and 
constants e,, ¢, the multiplier rule, the Weierstrass condition Ily , the non- 
singularity condition, and IV%. Then ©: y;(x), u.(x), Xo = 1, Ax(x) is an ex- 
tremal for 8. In particular, the multiplier rule implies that A,4,(2) (s = 
1,---,@) are constant on 2,22 and that €, = Xmas. Corresponding to & we 
denote by E the set y;(x), Xo = 1, A(z) (a1 S F S 2); such a set will be 
termed an extremal for B. If we set 


F[z, Y, r; \] = Aof [z, y; r] + AaPalZ, y; r] + Am+sh.[2, y, r), 
the multiplier rule implies that along EF 


qF,, 5» _ es 
dr F,, wae 0, Pa — 0, 
and the relation 

((F — yiF,,) dx + F,, dydli + dg + db, + Amis dx, = 0 


holds for every choice of the differential dz, , dy , dzz, dyig. Since problem B 
does not involve the variables u, explicitly and the differential equations (2.2’) 
contain the u, only linearly, it readily follows that Ily is equivalent to the 
condition that there exist a neighborhood N of the elements of £ in [z, y, r, AJ- 
space such that if [z, y, r, A] is in N and ¢,[z, y, r] = 0, dalz, y, 7] = 0, then 


(2.7) S[z, y, r, A; 7] = Fiz, y, 7, A] — Fiz, y, r, A] — (*% — rdF,,[z, y, r, A] 


is non-negative. The non-singularity condition for 8 is reducible to the non- 
singularity of the matrix 


Fy; Per; 
Par; Oas 


along the elements of EZ. Finally, IV* may be written as the condition that 
along E the second variation 


(2.8) Jo = 2y[f:, (21), & , (x2)] + [ 2w[x, n, n'] dx 


be positive for all non-identically vanishing sets [& , & , 9i(x)] satisfying along F 
the equations of variation 


®,([z, ; n'| = 0, Wile , n(21), &, n(22)] = 0, 


X,fé: » (at), &, n(x2)] + / (her, + hoy; ni) dx = 0. 
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In (2.8) 
Qo = Frrning + Wrevjniny + Pov min 


and 2y is a quadratic form in its arguments whose explicit form will not be 
given.” 

It is well known that conditions IIly and non-singularity imply the strength- 
ened Clebsch condition III’; that is, at the elements [z, y(x), y’(x), A(z)] 
(2; S x S 22) of E the quadratic form F,,,.,7,;7; is positive for all sets (4;) # (0,) 
satisfying ¢a,,7; = 0 (a = 1,---,m). It has also been pointed out recently 
by Hestenes and the author (see [6]) that conditions IIy and non-singularity 
imply the existence of a neighborhood of the elements of £ in [z, y, r, A]-space 
such that for [z, y, r, A] in this neighborhood, ¢,[z, y, r] = 0, dalz, y, 7] = 0, 
and (7;) * (r,), the &-function of (2.7) is positive. This neighborhood in 
general may be smaller than N, but for simplicity we shall suppose that N refers 
to such a restricted neighborhood; this strengthened condition will be denoted, as 
usual, by IIy . 

If C: Y,(z) (X, S x S X¢) is admissible for the problem B, we shall denote 
by € the set 


Xe 
6: Y,(z), U.(z) = / hilt, ¥(), ¥) dt (X, <2 < Xp, 


which defines an admissible arc for the problem 8. The usual sufficiency 
theorem for the problem of Bolza gives the following result: 


THEOREM 2.1. Suppose that an admissible arc (2.1) for B is without corners, 
and E: yx), » = 1, (x) (41 S X S 22) satisfies with constants e, the multiplier 
rule, Ily , non-singularity, and 1V%. If © denotes the corresponding set ©: y;(z), 
u,(x), Xo = 1, Ax(x), then there exists a neighborhood § of € in xyu-space, and a 
neighborhood IN of the ends of © in [x1 , yu, Un, Te, Yi2, Usel-space such that 
J{C] > J[G] for every admissible arc € in § with ends in M and not identical 
with ©. 

The central purpose of this paper is to prove the following stronger theorem. 


THEoREM 2.2. Under the hypotheses of Theorem 2.1 there exist a neighborhood 
‘¢ of E in xy-space and a neighborhood 9M of the ends of E in |x, ya, 22, Yaak 
space such that J{C| > J[E] for every admissible arc C in ‘f with ends in OM and 
not identical with E. 


The passage from the result of Theorem 2.1 to that of Theorem 2.2 will be 
effected by the use of the Lindeberg theorem of §4. Preliminary to the deriva- 
tion of this latter result we shall, however, discuss in the following section 4 
problem B* which is entirely equivalent to B, and for which stronger forms of 
the Clebsch and Weierstrass condition are satisfied. 


’ For a discussion of the second variation, as well as its explicit form, the reader is 
referred to Bliss [1], pp. 68-71; see also Hestenes [3], p. 797, or Reid [10], p. 665. 





it be 


igth- 
A (z)| 

(0;) 
ntly 
arity 
pace 
d in 
efers 


d, as 


note 


ISOPERIMETRIC PROBLEMS OF BOLZA IN NON-PARAMETRIC FORM 679 


3. An associated minimum problem. Let 
f*lz, y, vr] = Six, y, r] + 2U(z)oalz, y, rioalz, y, r], 


where I(x) is a given function of class C* on 2,22 , and denote by B* the problem 
in which the expressions ¢a , ¥ , Xs, 9, 4s are as in B, but f is replaced by f*. 
Clearly an are (2.1) affords a minimum for B if and only if it affords a minimum 
for B*. If 


F*[x, y, 7, X] = rof*[x, y, 7] + Aadalz, y, 7] + Amrehelx, y, 7], 


the functions F and F*, together with their first order partial derivatives, are 
identical at a set for which ¢,[z, y, r] = 0. Hence E:y;(x), X = 1, Ad(zx) 
satisfies with constants e, the multiplier rule for B if and only if £ satisfies 
with the same constants e, the multiplier rule for B*. Along £, Paw = 
Fy ir, + Ux)barar; ; moreover, if J? denotes the second variation along E for B, 
then 


at on te / Ux)e[z, «, o' Welz, 9, ol dz. 


Consequently, each of the following conditions is satisfied for both the problems 
B and B* whenever it is satisfied for one of them: II, Ily , Il) , non-singularity, 
III, II’, 1V*, IVx." 

It will now be proved that whenever £ is an extremal for B satisfying ILI’ 
and Il, , the function l(z) may be so chosen that the corresponding problem 
B* satisfies certain stronger forms of these conditions. 


THEOREM 3.1. If E is an extremal for B satisfying III’, there exists a constant 
| such that along E the corresponding quadratic form 


(3.1) FY. .mimy = (Priv, + U Gar Par;)®im} 
for B* is positive definite. 


Condition III’ is equivalent to the assumption that the quadratic form 
F,,,,ramj is positive for all sets (x;) # (0,) which give the quadratic form 
(Gar,ar,;)4im; the value zero. Since the last quadratic form is positive semi- 
definite, one may prove the above theorem in a direct and quite elementary 
fashion. This theorem is, however, a ready consequence of a theorem on quad- 
ratic forms proved by the author.’ For, as there proved, the determinant 
P(c) = | Fy; — obarar; | is & polynomial of degree m in ¢ and has only 
real roots. Moreover, since the form (@ar,¢ar;)™im; 18 positive semi-definite, it 
follows from the proof of Reid [12] that if —J(x) is less than the smallest zero 
of P(e) for the arguments [z, y(zx), y’(x), A(x)] of E, then (F,,-; + Ux)bar Par) it 


‘For the explicit definition and discussion of the conditions in this list that have not 
been defined above, the reader is referred to Bliss [1], Hestenes [3], or Reid [10]. 

5 See Reid [12]; also A. A. Albert, Bulletin of the American Mathematical Society, vol 
44(1938), pp. 250-253. 
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is a positive definite quadratic form. In view of the continuity of the coeffi- 
cients of P(c) as functions of z, there exists a constant | such that for 2, < 
x S 2X the form (3.1) is positive definite along Z. In other words, if E satisfies 
III’ for B there is a suitably chosen problem B* for which the strengthened 
Clebsch condition holds without the restriction ¢.,,7; = 0. This new condition 
will be denoted by III*. 

The Weierstrass &-function for the problem B* will be denoted by &*{(z, y, 
r, A; 7]. In the future we shall be concerned only with a problem B* for which 
the function /(z) is a constant value that satisfies the condition of Theorem 3.1, 
The following corollary is an immediate consequence of a simple continuity 
argument and an application of Taylor’s formula. 


Corouiary. If E is an extremal for B satisfying condition III*, there exists 
a positive constant r> and a neighborhood No of the elements of E in [x, y, r, d)- 
space such that if [x, y, r, d] and [z, y, 7, \] are in Mo, then® 


(3.2) &*[z, y, 7,4; 7] = ro ll|F — rll. 


THEOREM 3.2. If E: y:(x), 0 = 1, A(x) (a1 S X S 2) is an extremal for B 
satisfying conditions I1y , III’ and lI is chosen as in Theorem 3.1, there exists a 
neighborhood ° of the elements of E in [x, y, r, A]-space such that for [x, y, r, 
in N, [z, y, r] ¥ (x, y, 7], and ¢.[z, y, 7] = 0, we have &*[z, y, r, 4; 7] > 0. 


The conclusion of this theorem is a stronger condition than II}, since it does 
+ 


not require that ¢.[z, y, r] = 0. This latter condition will be denoted by Ig, 


The proof of this theorem is essentially the same as the proof of Lemma 43 
of Reid [10]. In view of the non-singularity of B, and hence of B*, there exists 
a neighborhood $i of the elements of £ in [z, y, r, A]-space and a constant é 
such that for [z, y, r, \] in 91 the system 

FF i{z,y,r +2,\ +2] — Fiz, y,r, 4) = 0 (@ =1,---,n), 
(3.3) 
galt, y, T + 2] = 0, Vmi+s = 0 (a =1,---,ms=1,---,@9 
has a unique solution z; = z,{z, y, Tr, A], va = valz, y, 7, A] satisfying 2:2; + vava <4. 
We may obviously restrict 9l so that for [z, y, r, \] in { the corresponding set 
[z, y, r + z, \ + »] is in the neighborhood N and also in the neighborhood % 
of the above corollary. Then for [z, y, r, A] in Mand 7 such that ¢,[z, y, 7] = 0, 
it follows from (3.3) that 


&*[z, y, 7, A; F] = S*[z,y, r + 2,4 + 57] + [x,y 7, A; 7 + 2]. 


Since [z, y, r + z, \ + v] isin N and ¢,|z, y, r + z] = 0, condition II}, implies 
&*[z, y, r +z, + v; 7] 2 0, the equality holding only if 7; = r; + z;. More 
over, as [z, y, r, A] and [z, y, r + z, A] are in No , we have &*[z, y,r, A;r + 2] 20, 
the equality holding only if z; = 0;. The conclusion of Theorem 3.2 is an 
immediate consequence of these results. 


‘If z= (z,, ---, ze), || z || is used to denote the positive square root of z? + --- + 2. 
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We shall denote by &,, the Weierstrass &-function for h, , that is, 
&:,[z, y, 7; 7] = Adz, y, 7] — hlz, y, r] — (Fi — riher[2, y, 7], 


and represent by &, the set (G,,) (s = 1,---,q). The following corollary to 
the above theorem will be used in the proof of the Lindeberg theorem of the 
next section. 

Corotuary. If E satisfies the hypotheses of Theorem 3.2, there exists a positive 
constant k and a neighborhood ‘Vy of the elements of E in [x, y, r, A|-space such 
that if [x, y, r, A] is in Dy and ¢,[z, y, 7] = 0, then 


(3.4) || Galz, y, 7; 7] || S kS*[z, y, 1, d; 7. 


For let 94, be a neighborhood of the elements of £ in [z, y, r, A]-space such 
that there is an associated constant d > 0 satisfying the condition that if 
[z, y, r, A] is in My and va, SF d, va = 0 (a = 1, --- , m), then [z, y, r, A + 7] 
is in the neighborhood $1 of Theorem 3.2. For [z, y, r, A] in 9 and 
¢alz, y, 7] = O we then have 


0 < &*[z,y,r,X + v3 7] = S*[z, y, 7, A; F] A vmyeGu,[z, y, 73 7, 
that is, 
| Vm+en (2, Y,7T; 7] | Ss &*[z, y,7T, A; | 


for all sets vms,. Satisfying ymisY¥mis Sd. It is then a consequence of Cauchy’s 
1 ) 


inequality that (3.4) holds for k = d™. 

Using the above results, one may readily prove for &* an inequality corre- 
sponding to that established for & by Reid ({10], Theorems 4.1 and 4.2). In- 
stead of the function R{t] there employed, we shall use here the function ®{¢] = 
(/(t + 1) for t = 0. In some respects this function is simpler than the R{#] 
previously used. In particular, it is easily seen that for ¢ = 0 


(3.5) Rit] < min (¢, #7) S WiZd- 


TueoreM 3.3. If E satisfies the hypotheses of Theorem 3.2, there exists a 
positive constant r and a neighborhood Sz of the elements of E in |x, y, r, \)-space 
such that for |x, y, r, A] in Dy and ¢,[x, y, 7] = 0 we have 


(3.6) &*[z, y, 7, A; F] S rR{|| F — r|\). 


Let 9g be a bounded neighborhood of the elements of E in [z, y, r, A]-space 
with the property that there exists a constant d > 0 such that if [z, y, r, A] 
isin My and || v || < 2d, then [x, y, r + v, A] is in the neighborhood Qo of the 
corollary to Theorem 3.1 and also in SW determined by Theorem 3.2. If 
[z, y, r, A] is in My and ||*# — r|| S 2d, it is then a consequence of (3.5) and 
the corollary to Theorem 3.1 that (3.6) holds for r = 1. 

Now suppose that [z, y, 7, A] is in Me, the set [z, y, 7] satisfies ¢.[z, y, 7] = 0, 
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=> 2d. Let r* = r + O(F — r), where @ is so chosen that 
; clearly 0 < 6 S 3. If 


Tx, y, 7, ; p] = F*{[z, y, 7, + (01 — ri Fra, y, 7, A, 
we have 
T[x, y, r*, A; r*] — T[z, y, r, A; r*] = &*[z, y, r, A; r*] > O, 
T[x, y, r*, A; r] — Tlz, y, r, A; r] = —G*[2z, y, r*, A; r] < 0. 


Consequently, there exists a value 7 = r + o(f — r), 0 < o < 86, such that 
T\x, y, r*, 4; 7] = Tz, y, r, A; 7], and this relation implies 


- : , ee CT 
(3.7) &*[z, y, 7, A; 7] = G*[z, y, r*, A; 7] + * Sz, y, r*, A571. 
o 


Since [z, y, r*, A] isin MN, &*[z, y, r*, A; 7] 2 O;asO0 << ao < 6 S 3, (1 — @)/o> 
(1 — 6)/@ = 1/(26) = (1/(2d))|\\*# — r||. Hence if 7 is such that 


&*[z, y, r*, A; r] 2 7 for all [z, y, r*, A] in M, || r* — r|| = d, it follows from 
(3.7) and property (3.5) that 


S*[z, y, 7, A; 7] > (71/(2d))R[\| F — rv ||]. 


It has been proved, therefore, that with 9% chosen as indicated above the 
inequality (3.6) holds for r = min (to, 7:/(2d)). The simplicity of this argu- 
ment in comparison to the proof of Theorem 4.2 of Reid [10] results from the 
above Theorem 3.2 for the suitably chosen problem B*. Since in case 
galz, y, Tr] = 0, dalz, y, 7] = O we have &*[z, y, r, 4; 7] = Sz, y, r, A; 7], the 
results of Theorems 4.1 and 4.2 of Reid [10] are included as a very special case 
of the above theorem. If B is an ordinary problem of Bolza not involving 
the isoperimetric conditions (2.4), and one applies the expansion method of 
proof to a suitably chosen problem B* instead of to the original problem B, 
the proof of Theorem 6.1 of Reid [10] is materially simplified. In particular, 
Theorem 4.3 of that paper is no longer needed, since an even stronger form of 
its conclusion is an immediate consequence of the above Theorem 3.3. 


4. A Lindeberg theorem. In the proof of the results of this section use is 
made of the following lemma. 


Lemma 4.1. Jf z(x) (¢ = 1,---,m) are absolutely continuous functions on 
X, S 2S X2and |\z\| S 60n this interval, then 


Xo Xe 
(4.1) / z{|-| 2’ ||dr sas [ Rl || 2’ |] dx + || 2(X,) ||’, 
x; xy 


7 For fixed values |z, y, A), ¢ = T(z, y, 7, 4; p| is the equation in (, ¢)-space of the tangent 
plane to the surface ¢ = F*[z, y, A, p| atp =r. The following proof is essentially the same 
as the geometric proof given by Tonelli ({14], vol. 1, pp. 351-353) of a corresponding in- 
equality for the plane problem of the calculus of variations. 
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Xe Xe 

(4.2) [ lz |? dx s a [ RI|| 2” ||] dx + || 2(X) ||’, 
1 #1 

where d; = 2 max (1, 26) max (1, X2 — X;) and dz = (X2 — X;) d,. 

This lemma is established in a manner analogous to the proof of Theorem 
5.1 of Reid [10]; use is made of relation (3.5) and the following properties of 
the positive convex function Rt] (¢ 2 0) here employed: 

P,: Ifa 2 0, amin (1, a)R[t] S Rat] S a max (1, a) Rit]; 

P,: If t; 2 0, te = 0, Ril + &] S 2(MRlA] + Rlé)). 

Property P; is an immediate consequence of the relation ®{at]/R{t] = 
a((t + 1)/(at + 1)]; Ps follows by the use of an elementary inequality which 
gives Rl + &] S 2 + &)/(4 + & + 1) S A/a + 1) + G/(e + I). 

Suppose E:y;(x), X = 1, A(x) (a1 S x S 22) is an extremal for B which 
satisfies Ily and the non-singularity condition. As pointed out in §2, these 
conditions imply III’ and II,. In view of the non-singularity condition there 
exists an €) > 0 such that for | X, — z,| S & (v = 1, 2) the elements of EZ are 
defined and of class C’ on X,X_. The corresponding extremal €: y;(x), u.(z), 
hy = 1, Ax(x) for B clearly has its elements also defined and of class C’ on X,X2. 
In the proof of an effective Lindeberg theorem for the problem B we shall use 
the following notation. An are C: Y,(x) (X; S x S Xz) will be said to lie 
in (E), if || Y(~) — y(x) || S € (X; S x S Xz) and | X, — z,| S e (v = 1, 2); 


similarly, an are ©: Y,(x), U.(x) (Xi: S x S Xz) will be said to lie in (©), if 

Y(rz) — y(x) || S «, || U(x) — u(z) || S € (Xi, S x S Xe) and | X, — 2,| S «. 
In the definitions of (#), and (€), nothing is prescribed concerning the con- 
tinuity and differentiability of the functions Y,(x), U,(x). It will be under- 


stood in the following that the values of « used do not exceed ©. 


THEOREM 4.1. Jf E:yi(x), Xo = 1, A(x) (41 S FX S 2) ts an extremal for B 
which satisfies conditions I1y and non-singularity, then corresponding to a given 
n > 0 there exist constants p > 0, € > O such that if C: Yi(x) (Xi S x S Xo) 


is an admissible arc for B which ts in (E), , while 


(43) [ &, Y@, v'@, X@); Yaz S o, 


then the corresponding arc ©: Y;(x), U.(z) (Xi S x S Xz) is in (©), . 

Let « > 0 be such that if C is in (£),,, then the set [z, Y(x), y’(x), A(x)] 
(X; S x S Xz.) is in NL defined by the corollary to Theorem 3.2, and also in 
the neighborhood N_ of Theorem 3.3. It is supposed, moreover, that « is such 
that if C isin (£),, then [z, Y(x), y’(x)] (Xi S x S N-) is interior to the region R; 
in particular, on the set of values [z, Y(z), y’(x)] the functions A, , together with 
their partial derivatives of the first three orders, are uniformly bounded with 
respect to all C: Y,(z) (Xi: S x S Xz) in (£),,. 
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Now 


Xo z2 
Use) — ula) = [he ¥, de — [ hle, y, y'laz 


Xe Xe 
~ [ &,,[z, Y, y’; Y’]dx + [ hz, y, y’] dx 
z z2 


(4.4) . 
+ | (hla, ¥,u/) — hele, vs y+ (PE = ydhorde, ¥, 9'D de 


Xe 
m | &s[z, Y, y's Y'] dx + lh + In, 


where 
Xe 
I, = / hz, y, y'}dx + (Yi — ydher[x, y, y’] (2? 
x2 
Xo 
+ / (h.[x, Y, y’] pa h,{z, y, y’| ita (Y; —_ yi) dhe, [x,y,y')/dz) dz, 


Xo 
I, = / (Yi — yi) (her,lx, Y, y’] — her ilz, y, y’)) dx. 


Clearly the norm of the set (U,(z) — u,(x)) (s = 1,--- ,q) satisfies the 
inequality 


X2 
(4.5) || U(x) — u(z) || s / lz, Y, y’; Y’] dx | + || Li || + || Ze II. 


If C is such that [x, Y(x)] is in (EZ), and 0 < « < «&, we have 
Xe Xe 
& lz, Y, y’; Y’]dzx|| s i \| Sala, Y, y’; ¥’) || dz 
xX) 
(4.6a) os 
< & [ &*[z, Y, y’, A; Y’] dz, 
Xi 


the last inequality being a consequence of the corollary to Theorem 3.2. Jf 
we use c as a generic constant, it is a consequence of the elements of C being 


in (EZ), that 
(4.6b) || I, \| S ce. 
Moreover, since for 0 < « < «& the neighborhood (£), is interior to Si, deter- 


mined by Theorem 3.3, one may prove by the use of the inequality (4.1) that 


Xe 
elise f iY —yll-ll¥’—y' lide 


s eds| [wl ¥’ — vide + | YOR) — Xd [P| 


x1 


Xe 
S cd, ! [ &*(z, Y, y’, A; Y'Jdx+ é]. 
T Jx 
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The constant d, appearing in (4.6c) depends upon e and Xz; — X,, but is ob- 
viously bounded since | X, — z,| S eandO <¢e<«. Inthe above inequalities 
explicit values of the constants c could be given in terms of the bounds of the 
functions A, and certain of their partial derivatives. 

It follows readily from the inequalities (4.6) that positive constants p, « may 
be so chosen that for an admissible arc C in (£), and satisfying (4.3) we have 
|| U(x) — u(x) || < 7 on X,X2, where 7 is any preassigned positive value. In 
particular, if such an ¢ is chosen not to exceed the given n, the are ©: Y;(z), 
U,(z), (Xi S x S Xz) is in (©), . 

THeoreM 4.2. If E:yi(x), > = 1, Ax(x) (a1 S X S 2) is an extremal for B 
which satisfies conditions Ily and non-singularity, and p is a given positive con- 
stant, then there exists an e’ > 0 such that J[C] — J[E] = 4p for every admissible 
arc C: Y,;(x) (Xi S x S Xe) whose elements are in (E),: and for which 


(4.7) [ ° &t2, Y@), y'@, M@); Y'@l az > v. 


Let € > 0 be such that if C: Y;(x) (Xi, S x S X2) is an are with elements 
in (Z)., , then the set [z, Y(x), y’(x), A(x)] is in the neighborhood 9, of Theorem 
3.3. It is supposed, moreover, that « is such that if C is in (£),,, then 
[z, Y(x), y’(x)] (Xi S x S Xz) is in the region R, and the set [X,, Yi(X)), 
X;, Y:(X2)] isin R,. In particular, the function F* and its partial derivatives 
of the first three orders are bounded on the set of values [z, Y(x), y’(x), A(x)] 
uniformly with respect to all ares C: Y;(z) (Xi S x S Xz) in (£),, ; similarly, 
the function g and its partial derivatives of the first two orders are bounded 
on the set of values [X; , Y;(X1), X2, Yi(X2)] uniformly with respect to such 
ares C, 

For an admissible are C in (Z). (0 < ¢’ S e&) we then have 


Xe z 
JC] — JB} = 4g + fl pitz, ¥, Yde — [ prte, y, y'lde 
Xi Z1 
Xe ze 
= Ag + / F*([z, Y, Y’, Ajdz — / F*[z, y, y'’, A] dx 
Xi 2 


Xe 
Ag + / &*[z, Y, y’, A; Y]dx + J’ + J’, 
x) 


deter- 


‘hat where 


z) Xe 
J [ F*(z, y, y’, \)dz + / F*(z, y, y’, dz + (¥; — y)F>lz, y, y’, MI |X? 
Xi ze 


Xe 
+ {F*[z, Y, y’, A] — F*lz, y, y’, A) — (Yi — yo Fy lz, y, y’, Al} de, 


Xi 


Xe 
= [vt votFM, Y, 0 — Fhe, yy’, MI ae. 
x; 
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The last term in J' is written involving F;, since along FE, Fy, = dF*. /dr. 
In the expression (4.8), Ag = g[X1, ¥(X1), X2, Y(X2)] — gla, y(a1), 22, y(za)]. 
Again using c as a generic constant, we have 


(4.9a) | Ag| S ce’, | J’| S ce’, 
Xe 
Pisef \¥-yll-ll¥’—y' lide 
Xi 


Xe 
ce’ | IY? — y’ | dz 
x 


< ce [ “(1 + wf] ¥’ — y’ IID) de 


1 


Xe 
(4.9b) < ce| (Xs TY il &*[z, Y, y’, 4; ¥] az. 
T Jx) 


In the above inequalities explicit values of the constants ¢c could be given in 
terms of the bounds of the functions F*, g, and certain of their partial deriva- 
tives. The third of the above inequalities for | J* | follows from the preceding 
by the easily established relation ¢ < 1 + Rift]. 

By the use of inequalities (4.9) it follows readily that for p a preassigned 
constant there exists an e’ > 0 such that if C is admissible for B and has its 
elements in (£),- , and if for C inequality (4.7) is satisfied, then J[C] — J[E] > 
3p. Thus the theorem is proved. 

As stated in the introduction, for non-parametric problems neither the result 
obtained by Lindeberg, nor any one of the extended forms of the Lindeberg 
theorem established by Levi and Tonelli, is effective in the proof of Theorem 2.2. 
Each of the referred to results gives an analogue of Theorem 4.2; it is the 
analogue of Theorem 4.1 that is incapable of proof by these forms of the Linde- 
berg theorem. For brevity, we shall not state explicitly any one of these 
results. For one familiar with these conditions, however, the following simple 
example illustrates why they are not applicable to the proof of a result analo- 
gous to that of Theorem 4.1. 

Consider the problem of minimizing the integral 


2 


(4.10) J= [ y"” dx 
0 


in the class of ares y = y(zx) satisfying the end-conditions 
(4.11) y(0) = 0 = y(2) 


and the isoperimetric condition 


2 
(4.12) [ (1 — z)y” dx = 0. 
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Clearly BE: y(x) = 0, > = 1,A =0(0 S gz S 2) isa non-singular extremal for 
this problem satisfying conditions Ij and IV* . We shall center our attention 
on comparison ares C: Y(x) of the following character: Y(2 — x) = Y(a) and 


Y(x) = mz, :&S 
—m'x+2m", Vis 
= 0, Qn" <2 31, (m > 2). 


Such an are C obviously satisfies (4.11) and (4.12). The defining function Y (x) 
is identically zero except on two intervals of total length 4/m’; moreover 
the maximum value of Y(z) is 1/m, and the length of the curve C is 
2+ 4/(1 + m')' — 1]/m*. For such an are C, however, the value of 


v@) = [a-owora = -fa-oyora 


. . . ;.3 a to 

is readily seen to be less than —2(1 — 2/m*)m on 2/m*® S x < 2 — 2/m’. In 
particular, for preassigned quantities « > 0, K > 0, there exists a value of m 
such that the length of C differs from that of HE by not more than ¢, whereas, 


U(l)| > K. 


5. Proof of Theorem 2.2. Theorem 2.2 is a ready consequence of the results 
of the last section. In the first place, it is to be noted that there exists an 
» > 0 such that if 


Y,(z), U.(z) = [ ‘hd, YO, ¥'O)dt (XS a SX) 


is an admissible are for 8 and its elements are in (@), , then this set is also in 
the neighborhood § of Theorem 2.1 and its end-values are in the neighborhood 
M of that theorem. Let p and e denote the constants of Theorem 4.1 corre- 
sponding to this value of n, and using this value of p, let «’ denote the constant 
of Theorem 4.2. Finally, let F be a neighborhood of £ in zy-space and 9h a 
neighborhood of the ends of E in [xm , ya, 22, Yi2|-space such that if C is an 
admissible are for B lying in ‘fF and with end-points in {W, then the elements 
of C lie in both of the neighborhoods (£), and (£),, of E. 

Now consider an admissible are C: Y (x) (X,; S x S X2) for B that lies in ‘fF 
and has its end-points in MM". If 


Xe 
(6.1) [ Sz, YO, ¥@,@; Y'@lae 

Xx} 
does not exceed p, then by Theorem 4.1 the corresponding are € is in (@),, 
hence in the § of Theorem 2.1, and has its end-points in the neighborhood M 
of that theorem. Consequently, for such an are C we have by Theorem 2.1 
that J[C] => J[E], the equality holding only if C = Z. On the other hand, if C 
is an admissible are for B that lies in ‘fF, has its end-points in ON, and renders 
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the integral (5.1) a value greater than p, it follows from Theorem 4.2 that 


J(C] > J[E]. 


6. Osgood theorems. In this section we shall prove an Osgood theorem for 
the problem B formulated in §2. This result is a consequence of an Osgood 
theorem for a general problem of Bolza not involving isoperimetric conditions, 
together with the Lindeberg Theorem 4.2. For a discussion of Osgood theorems 
for simpler problems of the caleulus of variations, particularly those in para- 
metric form, the reader is referred to Bolza [2]. 


THEOREM 6.1. Suppose that B is an ordinary problem of Bolza involving ex- 
pressions (2.2), (2.3) and (2.5), and that E: yi(x), > = 1, Aa(x) (v1 S TF S My) 
satisfies with constants e, the multiplier rule, Uy , non-singularity, and IV%. 
Then there exists a neighborhood F of E in xy-space and a neighborhood M of the 
ends of E in |x. , ya, X2, Yi2l-space with the following property: corresponding to 
each neighborhood F’ of E interior to F, and each neighborhood M’ of the ends of E 
interior to M, there exists a constant r > 0 such that for every admissible arc C inF 
and with end-points in M, but which does not lie in F’ and have end-points in M’, 
we have J(C] — J[E] 2 r. 

In the proof of this theorem use will be made of inequality (6.11) of Reid [10] 
from which the sufficiency theorem of that paper is deduced. Using the nota- 
tion of that paper, we have under the hypotheses of Theorem 6.1 that there exist 
bounded neighborhoods F and M, together with a constant rp > 0 such that if C: 
Y (x) (X; S x S X2) is an admissible arc for B lying in F and having end-points 


in M, then 


(6.1) JIC] — JB] 2 43 (X= 2)" + ACK IP + fH) Nae. 


In (6.1) we are using the convex function {¢] introduced in §3 above, instead of 
the function R{t] of Reid [10]; consequently, in (6.11) of Reid [10] the constants 
d, , dz would now be replaced by the corresponding constants d; , d, of Lemma 4.1 
of the present paper. In (6.1) above we have also written §[|| h’(x) ||], whereas 
the corresponding term in (6.11) of Reid [10] is R[|| v(x) ||]. In the notation of 
that paper, however, v,;(z) = u(x)hi(ax) and ‘}v || = mo |! h’ || on X:X2. Hence 
by property P,; of Rd], Rf\| v ||] S Rlmo || A’ ||] S mo min (1, mo) Rf}] A’ ||]. 
Finally, in terms of the constants appearing in equation (6.11) of Reid [10], the 
above constant 7>o may be defined as the least of the values x — &, xm - 
€o(My + 1 + 2de + di), [r — eod3(Qde + d;)]mo min (1, m). As pointed out at 
the end of §3 above, for such a problem B the expansion proof of Theorem 6.1 of 
Reid [10] is simplified if one uses a suitably chosen corresponding problem B*; 
if such a problem B* is used, the constants appearing in (6.11) of that paper are 
somewhat simplified. Since these simplifications are not of fundamental 
importance, however, we shall not trouble to present them here. 

Now suppose that F and M are such that for an admissible are C in F and 
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with end-points in M inequality (6.1) holds. Suppose, moreover, that F’ is a 
given neighborhood of E interior to F and M’ is a given neighborhood of the ends 
of E which is interior to M. Then there exists a 6 > 0 such that if C: Y,(z) 
(X, < x S Xz) is in (E);, then C is in F’ and has end-points in M’. Con- 
sequently, if C is an admissible are for B in F, and with end-points in M, but 
which does not lie in F’ and have end-points in M’, then C is not in (E), ; that is, 
either (X; — 2)° + (X2 — 22)” = & or there is a value — on X,X¢2 such that 

y(é) — y(é) || 2 6. If for such an are C we have (X; — 1)" + (X2 — 2) = 
§, it follows from (6.1) that J[C] — J[E] = rod’. Suppose, on the other hand, 
that there is a point — on X,X_ such that || Y(é) — y(é) || 2 6. Since on X,X2, 
Yr) — ys(x) = uy(r)h,(x), and || Y(x) — y(x) || S Mo || A(x) || , it then follows 
that || h(E) || 2 6/Mo. Now || A(é) || S || ACE) — ACN) || + || ACM) ||, and 


in view of the properties (3.5) and P» of Rt] we have 
HL} ACE) ||] S VWRL|| ACE) — ACK) fl] + 2 || ACK) | 
Moreover, 


E 
h(@) — h(X,) = [ h’(t) dt, 


t *e 
Ij h(t) — A(X) || < / || h’(t) || dt < i || h’(t) | \de. 
xX, Xi 
Consequently, 


Ri} h(E) — ACN) ||] SS a || h’(é) ae | 


Xo 
= (Xo — X)) max (1, Xe —_ x)si| yy / || h'(t) ae] 
Xe — X,/x, 


by property P: ; hence by Jensen’s inequality, 
63) {|| A(@) — A(X) ||] S max (1, Xe — XY) , RI|| A’) ||] de. 
For such an admissible arc C, therefore, 
|| A(X) |? + [ RI] h’(é) ||Jdt = 4 min (1, 1/(X2 — X1)) RI] ACH II], 


and by (6.1), J[C] — J[E] = 4ro min (1, 1/(X2 — X1)) R[6/My]. Since by 
hypothesis M is a bounded neighborhood of the ends of E, there is a constant 
6; > O such that for admissible ares C with ends in M we have min (1, 1/(X2 — 
X,)) = 6,. Theorem 6.1 is true, therefore, for r = min (rod", (70}51) R[6/Mo)). 

If for an ordinary problem B of Bolza without isoperimetric side conditions 
the neighborhoods F and M of an extremal E and its end-points satisfy the 
conditions of Theorem 6.1, these neighborhoods will, for brevity, be referred to as 
Osgood neighborhoods. Since the problem % of §2 is an ordinary problem of 
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Bolza, under the hypotheses of Theorem 2.1 there clearly exist Osgood neighbor- 
hoods of the extremal € for B. 


THEOREM 6.2. Suppose that the hypotheses of Theorem 2.1 are satisfied, and that 
§ and YM are Osgood neighborhoods of the extremal € for B. If F and Mi denote 
the neighborhoods for the corresponding extremal E for B as determined in the proof 
of Theorem 2.2 in §5, then F and 9M are Osgood neighborhoods for B. That is, 
corresponding to each neighborhood ‘f' of E interior to ‘¥ and each neighborhood 
OM’ of the end-points of E interior to M, there exists a constant r’ > O such that 
for every admissible arc C lying in and with end-points in QM, but which does not 
lie in F' and have end-points in MN’, we have J[C] — J[E] = r’. 


For suppose that ‘f’ is a neighborhood of F in zy-space which is interior to 
‘f, and 9’ is a neighborhood of the end-points of EZ interior to 9. Denote by 
e’’ a positive constant such that if C: Y,(z) (X; S 2 S X2) is an admissible are 
for B in (£)., then C is in ’ and its end-points are in MI’. Let §’ be a neigh- 
borhood of € in ryu-space interior to § and M?’ a neighborhood of the ends of € 
interior to Yt such that if € is an admissible are for B lying in §’ and having 
end-points in 22’, then € lies in (€),. As § and M are Osgood neighborhoods 
of ©, let r* denote the constant corresponding to §’ and M’ determined by 
Theorem 6.1. 

Now suppose that C is an admissible arc for B lying in ¢f and having end-points 
in OU, but which does not lie in fF’ and have its end-points in Mi’. Then C does 
not lie in (#),. , the corresponding admissible are € for 8 does not lie in (€)., 
and consequently € does not lie in §’ and have its end-points in M’. If for C 
the expression (5.1) does not exceed p, it then follows as in §5 that € is in § and 
has its end-points in 9. We then have J[C] — J[E] 2 r*. On the other hand, 
if the expression (5.1) exceeds p, it follows from Theorem 4.2 that J[C] — J[E] 2 
3p. Consequently, if C is an admissible are for B lying in ‘f and having its end- 
points in 9", but which does not lie in ’ and have its end-points in 9M’, we have 
J[C] — J[E] 2 r’, where r’ = min (r*, 4p). 
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THE CURVES OF A CONJUGATE NET 
By E. P. LANE anv M. L. MacQueen 


1. Introduction. The purpose of this paper is to make some contributions 
to the projective differential geometry of the curves of a conjugate net on an 
analytic surface in ordinary space. §2 contains a summary of portions of the 
theory of a surface referred to a conjugate net. Power series expansions in 
non-homogeneous projective coérdinates for the parametric curves on the sur- 
face are then computed to terms of the sixth degree. Some geometrical appli- 
cations of these series are next made in a discussion of quadric surfaces having 
contact of the second order at a point of the surface. In the last two sections 
conjugate nets with specialized families are considered. 


2. Analytic basis. In this section we indicate an analytic basis for the study 
of a surface referred to a conjugate net. 

If the projective homogeneous coérdinates x“’, -.. ,2 of a point P, in 
ordinary space are given as analytic functions of two independent variables u, 
by equations of the form 


(1) x = 2x(u, v), 
the locus of P, as u, v vary is an analytic surface S. If the parametric curves 
on the surface form a conjugate net, the four codérdinates x and the four coér- 
dinates y of a point on the axis of the point P, satisfy a completely integrable 
system of partial differential equations of the form! 

Lun = px + ary a Ly, 
(2) Tuy = CL + ary + OSs; 
qx + 6x, + Ny (LN # 0). 


ll 


Lov 


Let the point P, be the harmonic conjugate of the point P, with respect to 
the two foci of the axis. It is easy to verify that 


(3) Yu = fx — nx, + sx, + Ay, Yo = gx + tr, + nz, + By, 
where we have placed 


IN = cy + ac + bq — ch — Qu, gL = c, + be + ap — ca — pr, 


(4) —nN =a,+a@ — ai —4g, tL = a, + ab+c— aw, 
sN = b, +ab+c — 6,, nL =b, +b? — ba — p, 
A = b — (log N),, B =a — (log L),. 


Received May 16, 1939. 
'E. P. Lane, Projective Differential Geometry of Curves and Surfaces, Chicago, 1932, 
p. 138. 
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The ray-points, or Laplace transformed points z_, , x, , of the point P, are 
given by the formulas 
(5) rt. = 2 — bz, Y= % — az. 


The following formulas give some of the invariants of the parametric conju- 


gate net: 


H =c+ab-—-a,, K=c+ab—b,, 
§ = sN R = iL, 
(6) hie 
8B’ = 4a — 26 + (logr),, 8G’ = 4b — 2a — (logr)., 
D = —2nL, r= N/L. 


We shall employ the covariant tetrahedron z, z_; , 2: , y as a local tetrahedron 
of reference with a unit point chosen so that a point 


(7) X = wit + Yori + Ysti + yy 


has local coérdinates proportional to y;,--- , ys. In this coérdinate system 
the equations of the osculating planes of the parametric curves C, , C, at the 
point P, are respectively ys = 0, yo = 0. 


3. Power series expansions. An analytic curve in ordinary space can be 
defined by expressing two of the non-homogeneous coérdinates of a point on 
the curve as power series in the third coérdinate. Such power series expansions 
for the u-curve at a point P, of the surface (1) may be calculated in the fol- 
lowing way. 

The coérdinates X of a point near P, and on the u-curve through P, are 
expressible by Taylor’s expansion as power series in the increment Aw corre- 
sponding to displacement from P, to the point X along the u-curve, 


, 2 3 4 
(8) X = 7 + Lu Au + $2 uy,Au + Sunn AU + Sr Fuunn Au + _—~ @ 
Expressing each of tu. , Zuuu.,-:: a8 a linear combination of z, z,, 2%, y, we 
’ I< 
find that X can be expressed in the form (7), where the local coérdinates 
yi, --- , ys of the point X are given by the expansions 


yr: = 1 + bAu + 3(p + ba)Au® + As Au’ + gyAyAu‘ + ---, 
ye = Au + fadu® + Ua, + a’ + p — nb)Auw’ + xB, Au‘ 


+ reyBsAu° + cee, 
1 ‘ 1 


9) = — Au + es [a + 2b — 2l,, + (log ),JAu* 
1 A) 5 l A) Y 6 
+O F ee Oe 


ys = 4L Aw? + BL (a +b — LA’ + gy LD Aut + hel Ds Aw’ 
+ rtoLDeAu’ + ---, 
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wherein / is defined by 
l= logr 
and the coefficients A;, --- , Ds are defined by the following formulas: 
A; = (p. + ap + fL) + b(a, + @ + p — nL) + asl, 
Ay = (pu tap +fL)y + b(p. + ap +fL) + (p + ba)(au + a + p — nl) 
+ bla, + a + p — nL), + sL(2ab + c) + a(sL). 
+ (f + as — bn)(L, + al + AL), 
Be = (pu + ap + fL) + aay + a + p — nL) + (au + a + p — nl), 
+ asL — n(L, + al + AL), 
Bs = a(py + ap + fL) + 2p. + ap + fL)u + (au + a + p — nL)’ 
+ ala, + a’? + p — nL) + (au + @& + p — ML)uu + 2a(sL). 
+ sL(c + ab + a, + aa) + (f + as — nA — an — n,)(L. + aL + Al) 
— 2n(L. + aL + AL), , 
Cs = (a, + a° + p — nL) +0 + by + 2b(sL)u/sL + (sl) uu/sL 
+ (a+b—1,)(A +b + s/s) + AL, + aL + AL), /L, 
Co = (pu tap +fL) + (arta +p—nl)(a+b+A + 2L,/L + 8/9) 
+ 3(a. + a® + p — nL) + ash + b° + 3bb, + duu + 3(bu + b°)(sL)u/ah 
+ 3b(sL)wu/sL + (8h) uuu/sl + (a + b — ly)(bu + 0° + 2bs,/s 
(10) + Su./s + bA + 2A, + As,/s + A® — nL) 
+ 3(Ly. + al + AL)u(b + A + 8./s)/L + 3(Lu + ab + AL)wu/L, 
Dy = (a, +o + p — nL) + A(a +b — l,) + (Ly + aL + AL), /L, 
Ds = (pu + ap + fL) + (au + a + p — nL)(a + b — ly) + ash 
+ 2a. + a’ + p — nL), + 2A(Ly + aL + AL,/L 
+ (a + b — 1,)(A® + Ay — nL) + (Ly + aL + AL)w/L, 
De = (pu + ap + fL)(a + b — ly) + 3(pu + ap + fL). 
+ (a, + a’ + p — nL)(a’ + p+ aA + aly/L + 2a, + A* + As 
+ 2AL,/L + Lu/L — nb) + 3(a +b — ly)(au + ao + p — nL)y 
+ 3(a, + a? + p — mL)w + sL(c + aa + aA + aL,/L + 2a, + d) 
+ 3a(sL), + (a + b — L,)(fL — ank — 2nAL — 2n,L — nb, + ash 
+ A’ + 3AAy + Aw) + 3(Lu + al + AL),(A* + Ay — nL)/L 
+ 3A(Ly + aL + AL)wu/L + (Lu + aL + AL)uuu/L. 











(14 


Al) 
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Introducing non-homogeneous coérdinates by the definitions 


(11) t=H/mn, Yruy/yw, 2=y/m, 
we find, by use of (9), the following expansions: 
a = Au + 4(a — 2b)Auw’ + 3[(a — 2b), + (a — 2b)? + nLlAu' 
+34[B, — 4A; — 6(a — 2b)(p + ba) — 4b(a — 2b), — 4b(a — 2b)’ 
— 4bnLJAu‘ + y45{Bs — 5Ay — 10(a — 2b) As — 10(p + ba)[(a — 2b), 
+ (a — 2b)’ + nL] — 5b[B, — 4A3 — 6(p + ba) (a — 2b) — 4b(a — 2b), 
— 4b(a — 2b)” — 4bnL]}Au’ + ---, 


l > ‘ > = 
_ ore +5 L Die — 2b — 2h, + (log S)uldu! 
+ x 8 ? 10, — 10(p + ba) — 5bla — 2b — 21, + (log H).u)}Au’ 


(2) + = © 10, — 6bCs — 20As + 15(28? — p — ba)la — 2b — 2k, 
+ (log S).] + 60(p + ba)}Au* + ---, 
z = 4LAu* + tL(a — 2b — 1) Au’ 
+ sL[D, — 6(p + ba) — 4b(a — 2b — 1,)JAu' 
+ xteL[Ds — 56D, — 10A3 + 10(a — 2b — 1,)(2b° — p — ba) 
+ 30b(p + ba)|Au’ + rh5L[Ds — 6bDs — 15(p + ba) Dy 
+ 30b7D, — 15A, + 60bA; — 20(a — 2b — 1,) As 
— 120b(a — 2b — 1,)(b* — p — ba) + 90(p + ba)’ 
— 180b°(p + ba)JAu® + --. 


Writing y and z as power series in z with undetermined coefficients and de- 
manding that these series be satisfied by the series (12) identically in Au as 
far as the terms of the sixth order, we obtain the following power series expan- 
sions for the u-curve at a point P, of a conjugate net: 


-19,4 1 9 QD os 1D 6 
(13) y=a2 t ZF (I + 166’)z* +; 0 7 et + 5 -agx + , 


z= 312? + 4LC'2* + gy Lz + Poe + rhoLber® + ---, 
where the coefficients a5, --- , bs are defined by the formulas 
by = 4(20, + Cl, + 246”), 
ds = 3b + 2407 + 31° + J, 
(14) bs = baw + (16C€’ + lb, — 8C’D + GLP, 
dy = 4bs + 61d, + E+ Ju + Jl, — FID + 2LP, 
bs = bon + (60C’ + $lu)bs — 8/1886’ + 5h)be — 406’ + 10 =<, 
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in which J, J, E, and P are defined by placing 
I = (log Du + 4€’ + 3, 
J=I1, — 3° + 41, +4C€I+9, 
E = 31° + 41J + 40C'J + 4807, 
P =f + as — bn. 


(15) 


II 


Analogous expansions for the v-curve at the point P, can be written by 
making the appropriate symmetrical interchanges of the symbols. 


4. A canonical form for the expansions. In this section it will be shown 
that by suitable choice of the coérdinate system the power series expansions 
(13) can be reduced to an especially simple canonical form 

The parametric equations of the osculating twisted cubic at the point P, of 
the u-curve can be written in the form 


1+ 31t+ Ct + Dé, 


= 

yo =t+Gt + Fe, 
(16) > 1S; 

ws = 6 ry - 

ys = 4Lf, 


where we have placed 
G = }I — $C, 
F = go + HH? — 4b, — $C’ + 23°C"), 
D = gobs — FC'by + IF + 4344C 
C = Yo + U1 + the — $07 - 496°). 


(17) 


The osculating twisted cubic (16) may also be represented by means of two power 
series expansions which must agree with the series (13) up to and including the 
terms in z’. For this result we find 

If ;, 1 $ 9 


H 
= a ( 6 ‘S der? iar 
as)” 6r~ + og 7 + 1662" + i30 raz” + a0 eens 


z= ALa” + pal -+ Lb, x* + rtobbsz° + rhoLbx° + Soo, 


where 
ds = 4b; + 61b, + E + 41, 

be = (31 + 56C’)bs + 4805 + gol3(4J + °) — 123207 — 35,8400” ]by 
— xho(4J + 1°)? + 9O'(4J + 1°)(31 — 400’) 
— 326°(1? — 5606’7 — 11,200 ©”). 











yer 
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Let us make the transformation 


X+FY+GZ 


* 1+ YX + DY + C2’ 
6(O/r)Y 
‘ = : 
(20) ym 1 + HX + DY + C2’ 
L 
2 $LZ 


~ 1+ 3X + DY 4+ CZ 


from the old coérdinates zx, y, z to new codrdinates X, Y, Z. This transforma- 
tion moves the vertex (0, 0, 0, 1) of the local tetrahedron of reference to a point 


on the osculating cubic of the u-curve. Moreover, the edge X,; = X2 = 0 of 
the new tetrahedron is tangent to the osculating cubic at the point (0, 0, 0, 1), 
and the face X,; = 0 is the osculating plane of the cubic at this point. The 
unit point is on the osculating cubic whose equations become 

(21) Y=X, Z=xX’. 


Thus we find that the power series expansions for the u-curve at the point P, 
can be written in the canonical form 


Y = X* + AX’ +..., 
Z = X* + BX’ +... 


in which the coefficients Ag , Be are defined by the formulas 


(22) 


Ag = réo(as — Ge), 


(23) . 
Bs = sea (be — bs). 

5. Applications to quadrics having second-order contact with the surface. 
The power series expansions deduced in the preceding sections will now be 
employed to investigate certain configurations which are covariantly associated 
with a point of a conjugate net of a surface. 

First of all, the equation of any quadric surface having contact of the second 
order with the surface at the point P, can be written in the form 


(24) La’ + Ny’ — 2z + kere + kyyz + ky’ = 0, 


where ky , ks , ky are arbitrary. By means of equations (13) and the analogous 
expansions for the v-curve, it is easy to show that the quadrics (24) for which 


(25) ke = 780’, ky = APR’ 

are the quadrics having contact of the third order with both the u-curve and 
the v-curve at the point P,. Thus we find that any quadric of the pencil 

(26) La* + Ny? — 2z + 48C'rz + 42B’yz + yz’ = 0, 

where ky is arbitrary, has contact of the third order with the parametric curves at a 
point P, of the surface. If a unique quadric of this pencil is desired, we may 
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choose the one that passes through the covariant point P,. For this quadric 


we have ky = 0. Incidentally, it may be remarked that the unique quadric 
for which kz = ks = ky = O in equation (24) has been called* the canonical 


quadric of the parametric conjugate net. The axis and ray are polar reciprocal 
lines with respect to this quadric which passes through the point P, . 

Among the quadrics (24) there is a pencil having contact of the fourth order 
with the u-curve (13) at the point P,. For this pencil we find 


(27) ky = . C, k= - (6. + 3C'l, + 86”), 


with k; arbitrary. It is of interest to observe that the quadric of Moutard for 
the u-tangent’ is a unique quadric of this pencil, and for this quadric we have 


ks = 0. Finally, there is a unique quadric of this pencil having fifth-order 
contact with the u-curve at the point P, , and for this quadric we find 
(28) 2a = 5b — 86’b, + = c". 


The equations of the analogous quadrics associated with the v-curve at the 
point P, can be written without difficulty. 

Lane has shown‘ that the codrdinates of the principal points of Bompiani’ 
of the two curves of the parametric conjugate net at a point P, of a surface are 


(29) ria t+ $C’x + (x, + $B’z)r*. 

These points are evidently on the associate conjugate tangents 
(30) Le’ — Ny =0, 2z=0, 

and on the principal join whose equations are 

(31) 8(C’x + B’y) = 3, z = 0. 


A geometric characterization of the principal points of the parametric curves 
at the point P, is contained in the following theorem, the truth of which is 
easy to verify. 

At a point P, of a conjugate net the principal point on each associate conjugate 
tangent is the pole of the plane determined by the axis and the other associate conjw 
gate tangent with respect to any quadric of the pencil (26). 

Moreover, it is then obvious that the principal join of the fundamental pare 


2 W. M. Davis, Contributions to the theory of conjugate nets, Chicago doctoral disserta 
tion (1932), p. 10. 

?M. L. MacQueen, On the principal join of two curves on a surface, American Journal of 
Mathematics, vol. 58(1936), p. 624. 

‘E. P. Lane, /nvariants of intersection of two curves on a surface, American Journal of 
Mathematics, vol. 54(1932), pp. 699-706. 

‘E. Bompiani, /nvarianti d’intersezione di due curve sghembe, Rendiconti dei Lineei, 
(6), vol. 14(1931), pp. 456-461. 
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metric curves at a point P, of a surface is the polar line of the axis with respect to 
any quadric of the pencil (26). 

It is easy to verify that the polar line of the ray with respect to any quadric 
of the pencil (26) joins the point P, to the point 


_ 8C'r, - 8B'x, 


(82) 3L 3N 


+ y, 


. ° ° 6 P ° 
and thus lies in the canonical plane’ whose equation is 
(33) B’r — rC'y = 0. 


Furthermore, the polar jine of the associate axis with respect to any quadric 
of the pencil (26) is found to have the equations 


(34) 2(C’x + B’y) = 3, z= 0, 
and so this line passes through the canonical point 

(35) B’r., — Cn 

of Davis. 


6. Conditions for twisted cubics. Consideration of the canonical expansions 
(22) leads to some interesting results. In the first place, application of the 
projective differential theory of curves in ordinary space shows that the u-curves 
belong to linear complexes in case Ag = 0. By means of equations (14), (19), 
this condition can be reduced to 


(36) Jut Jl, — J — 40D + 2LP = 0. 


Thus we easily have the theorem essentially the same as a theorem formerly’ 
obtained by Lane using a more laborious process: 

A necessary and sufficient condition that the u-curves of a conjugate net in 
ordinary space belong to linear complexes is given by equation (36). Further 
application of the theory of curves shows that the u-curves are twisted cubies 
incase Ag = By = 0. The condition Bs = 0 can be written 


bs. — $U — Ll, + 240’)bs — gol3(4J + 1°) + 784), + 392C'1, — 123267 
(37) — 9152C” |b, + 785(4J + 1°)? — 20’'(47 + 1°)(37 — 406’) 
HO 


r 


+ 426°(1° — 560C’T — 11,200€”) — 320€”D + 240C’LP + 10 = 0. 
Consequently the following theorem is established: 

Necessary and sufficient conditions that the u-curves of a conjugate net be twisted 
cubics are given by equations (36), (37). 


*W. M. Davis, loc. cit., p. 17. 
'E. P. Lane, Contributions to the theory of conjugate nets, American Journal of Mathe- 
matics, vol. 49(1927), p. 574. 
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7. Segre-Darboux net. The theory of the preceding sections will now be 
applied to some special cases which will be considered in this and the following 


section. 
It is known that the u-curves of a conjugate net are curves of Darboux and 
the v-curves are the corresponding curves of Segre in case @’ = 0. In this 


section such a net is called a Segre-Darboux net. The expansions (13), when 
specialized for the u-curve at a point P, of a Segre-Darboux net, become 


1D 3; 1D,4 1 %( 3 ‘) . 1 ‘ 
"ér the + I » 
(38) 7 j7* *H, * J+5 t+ so ee TF , 
> AL” + Jol’ Px’ + rhgLbex’ oa ree 
where 


ag = Jy + Jl, + 32° + 41J — 4D] + WLP, 
(39) HS 
7 


’ 


be = OLPl, + 6(LP), + 10 


and the definitions of J and J reduce to 
(40) I = (log Su t+3u, J=1.+ Hl, —- +9. 


Moreover, the expansions (18) for the osculating cubic at the point P, of the 
u-curve become 


(41) y 6r 24 
z= Ba? + gol? Pr’ + rhoLbex’ + -:-, 
wherein 
dy = 31° + $1J + 24LP, 


(42) . a3 
be = OLPI — y$o(4J + T°)’. 


Parametric equations of the osculating cubic (41) are given by 
yi = 1+ Ht t+ So(BJ + 70 + rhol(l’ + 41 + 16LP)¢, 
t+ Ul + go(4J + Pe, 


io 
(43) 1 
= 2 f 
6r 
ys = 4Lt’. 


It is known that through a non-asymptotic tangent at a point of a surface 
there are ordinarily two planes which produce sextactic sections of the surface. 
Performing the requisite calculations, we find that the equations of the two 
aforementioned planes which pass through a non-specialized u-tangent at 4 
point of the surface are given by 


(44) y¥ = pz (i = 1, 2), 
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where p: , p2 are the roots of the equation 

(45) LNG'p? — 3L(H — ©)p — dubs + C's — 3426" = 0. 

For the u-tangent at a point P, of a curve of a Segre-Darboux net, the equa- 
tion of one of the planes (44) is found to be 


(46) 5(H — S)y + 3Pz = 0, 


and the other plane is the tangent plane, z = 0, at the point P, of the surface. 
If H # §, the plane (46) coincides with the osculating plane at the point P, 
of the u-curve in case P = 0. Thus we find that through a tangent at a point 
P, of a u-curve of a parametric Segre-Darboux net the plane, except the tangent 
plane, intersecting the surface in a curve with a sextactic point at P, coincides 
with the osculating plane of the curve at the point if, and only if, P = 0. 

It can be shown from the results of §5 that the quadric of Moutard for the 
tangent at a point P, of a u-curve of a parametric Segre-Darboux net coincides 
with the canonical quadric whose equation is 


(47) La’ + Ny’? — 22 = 0. 


This quadric is intersected by the osculating cubic (43) in six points, five of 
which coincide at the point P,. The point of intersection distinct from the 
point P, is the point for which 

2 
(48) E © + on + ry = 18LP. 

r 320 
Thus we arrive at the following result: 

At a point P, of a u-curve of a parametric Segre-Darboux net the osculating 
cubic of the curve intersects the quadric of Moutard for the tangent at the point of 
the curve in points which coincide at P, if, and only if, P = 0. 

Finally, it will be observed that the u-curves of a Segre-Darbourx net belong to 
linear complexes in case 


(49) Jut dl, — iJ — 4O1 + 2LP = 0, 
wherein I, J are defined by equations (40). Moreover, these curves are twisted 
cubics if to equation (49) is adjoined the condition 


‘ y HE 
(50) * (LP). ~ LPU - t) + l (4 + 1)? + 1OHS = 0. 


160 9 
8. Nets with one family plane curves. In this section we again specialize 
the general theory by supposing that the u-curves are plane curves. It is 
known that the u-curves of a conjugate net in ordinary space are plane curves 
in case § = 0. Therefore the expansions (13), when specialized for a plane 
u-curve at the point P, , become 


y = 0, 


(51) | | 
z= ym + sLC'x* + gy Lb" + reoLlbsx” + rhoLbex® + eee, 











702 E. P. LANE AND M. L. MACQUEEN 


where by, 6; are defined by equations (14) and the definition of bs reduces to 
(52) bp = bsu + (606' + 3lu)bs — 40C’bu, — 40C'l.d, — 15046, . 


The equations of the conics having contact of the third order with the curve 
(51) at the point P, are easily found to be 


z — 4a’ — §$C’'rz + hz’ = 0, 


(53) 
y = 0, 


where h is arbitrary. For the osculating conic the parameter h has the value 


(54) Lh = 1286” — 4b,. 


Let us substitute the power series (51) in the equation of the osculating conic 
of the curve and demand that this equation be satisfied identically in z up to 
and including the terms in zx’. The curve is then hyperosculated by its oscu- 
lating conic, so that if this happens at every point of the curve, then the curve 
is a conic. Thus we prove the theorem: 

If the u-curves are plane curves, they are conics if, and only if, 


(55) 1286” — 18(2C.,. + Clu. + 36.1, + CL — 26’D) — 27LP = 0. 


The polar line of the ray-point z_, with respect to any conic of the pencil (53) 
has the equations 


3Lz + 8C’z = 0, 
(56) 
y = 0. 
This line evidently coincides with the axis at the point P, in case ©’ = 0, so 
that the curve (51) is then a plane curve of Darboux. Therefore a u-curve 
which is a plane curve is a curve of Darboux if, and only if, the ray-point x_, is the 
pole of the axis with respect to any conic of the pencil having contact of the third 
order with the curve at the point P, . 
It follows from equations (51) that if the u-curves are plane curves of Darbouz, 
the expansions for the curve at the point P, are 


y = 0, 
LDa? + Dy? Px’ + ghoL[3LPl, + 2(LP).)2° + ---. 


Zz 


x, P ° 8 
It may be remarked that Cech has investigated’ the surfaces whose curves of 
Darboux are plane using, however, asymptotic parameters instead of conjugate 
parameters. 


* Fubini and Cech, Geometria Proiettiva Differenziale, Bologna, 1926, vol. 1, p. 170. 
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If P # 0, the equations of the eight-point nodal cubic at the point P, of the 
curve (57) are found to be 


2(L? Px ~ dbz) = 4L*° Px® — pglbex’z + 2LP*2’, 


Il 


(58) 
y = 0, 


where 
be = 3LPl, + 2(LP), . 
The nodal tangents at the point P, are the u-tangent, 
y=z= 0, 
and the projective normal of the curve whose equations are 
6L’Px — bez = 0, 
(59) 
y = 0. 
Thus we find that at a point of a u-curve which is a plane curve of Darboux the 
projective normal of the curve coincides with the axis of the net if, and only tf, 
(60) 3LPl, + 2(LP), = 0. 
It is known that the codrdinates of a variable point on an analytic plane 
curve satisfy an ordinary differential equation of the form 
(61) x’ + 3px” + 3px’ + psx = 0, 


where accents denote differentiation with respect to the parameter along the 
curve. Two invariants 6; , 43 of this differential equation are given by 
8 / 
6; = P3; — $P2, 
(62) 
” = a2 ~_ 2 
63 = 66303 = 703 —_ 27 P263 ’ 
where P: , P; are defined by the formulas 
2 , 
P, = po — Pi — Pi; 
(63) 3 7 
P; = ps — 3p~ip2 + 2pi — pr- 
It is furthermore known that the integral curves of equation (61) are conics 
in case 6; = 0, and are coincidence curves in case 63 = 0. 
If the u-curves are plane curves of Darboux, on calculating the differential 
equation of the form (61) for these curves and making use of equations (62), (63), 
one finds that the invariants 63 , 63 are given by 


05 = — LP, 


(64) : : : 
0; = 6LP(LP) wu — 7(LP). + OLP)*(luw + 3 + D). 
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It follows from equation (55) or from the form of the expansions (57), as well ag 
from the vanishing of 6; in the first of equations (64), that if the u-curves are plang 
curves of Darboux, they are conics if, and only 7, 


In this case the expansions for the curve at the point P, are 
y = 0, 

(65) . 

z Lx. 

Finally, if P + 0, equating to zero the expression for 63 in the second of equation 

(64) yields the necessary and sufficient condition that a u-curve which is a pla 


curve of Darboux be a coincidence curve. 
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